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COMPOSITIONS AND METHODS FOR THE THERAPEUTIC USE 
OF AN ATONAL-ASSOCIATED SEQUENCE FOR A GASTROINTESTINAL 

CONDITION 

[0001] The work herein was supported by grants from the United States 
Government. The United States Government may have certain rights in the invention. 

[0002] This application claims priority to a provisional application Serial No. 
60/137,060 filed June 1, 1999; a second provisional application Serial No. 60/176,993 filed 
January 19, 2000; and a nonprovisional application Serial No. 09/585,645, filed Jime 1, 2000. 

FIELD OF THE INVENTION 

[0003] The present invention relates in general to the field of genetic diagnosis 
and therapy and, more particularly, to the characterization and use of an atonaZ-associated 
nucleic acid or amino acid sequence, or any of its homologs or orthologs, as a therapeutic 
agent for the treatment of a gastrointestinal condition. 

BACKGROUND OF THE INVENTION 

[0004] An intricate pattern of interactions within and between cells directs the 
sequential development of neurons from dividing neuroepithelial progenitor cells. Multiple 
extracellular and intracellular signals moderate this process. Among the key intracellular 
signals are transcription factors, which induce the expression of a cascade of genes. One 
subclass of transcription factors, belonging to flie basic helix-loop-helix (bHLH) family of 
proteins, is expressed early on when the decision to proliferate or differentiate is made. This 
function is a particularly crucial one as mutations in these genes early in development can 
wipe out entire neural structures. 

[0005] In Drosophila, the gene atonal (ato), which is homologous to Mathl, 
Math2, Hathl and Hath2, encodes a bHLH protein essential for the development of 
chordotonal organs (sensory organs found in the body wall, joints and antenna that function 
in proprioception, balance and audition) (Eberl, 1999; Mclver, 1985; van Staaden and Remer, 
1998). CHOs populate the peripheral nervous system (PNS) in the body wall and joints 
(thorax, abdomen, sternum, wings, legs) and antennae (Moulins, 1976), providing the fly with 
sensory information much as touch and mechanoreceptors do in vertebrates (Mclver, 1985; 
Moulins, 1976). Boyan (Boyan, 1993) proposed that, in the course of evolution, different 
CHOs became specialized for hearing in different insects. This hypothesis was recently 
confirmed by van Staaden and Romer (1998). In Drosophila, CHOs in the Johnston organ. 
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located in the second antennal segment, function in near field hearing (Dreller and Kirschner, 
1993; Eberl, 1999) and negative geotaxis. 

[0006] During development ato is expressed in a cluster of progenitor cells fiom 
which the CHO founder cells are selected (Jarman et aL, 1993). It likely fimctions by 
regulating the expression of genes necessary for the specification and development of the 
CHO lineage; as it encodes a basic helix-loop-helix protein (bHLH) that dimerizes with the 
Daughterless protein and binds to E-box sequences, thereby activating genes (Jarman et al, 
1993). CHO specificity is encoded by the ato basic domain, which is required for DNA 
binding in bHLH proteins (Chien et a/., 1996; Davis et al, 1990; Jarman and Ahmed, 1998; 
Vaessin et al., 1990). ato is both necessary and sufficient for the generation of CHOs in the 
fly: loss of ato function leads to the loss of CHOs, while ectopic ato expression causes 
ectopic CHO formation (Jarman et al, 1993). Adult flies that lack atonal fimction are 
uncoordinated, do not fly, and are deficient in hearing. Overexpression of the fly atonal gene 
can generate new chordotonal neurons, indicating that atonal is both essential and sufficient 
for the development of this neuronal population. 

[0007] In vertebrates, during myogenesis and neurogenesis, cell fate specification 
requires basis helix-ioop-helix (bHLH) transcription factors. Mathl (for mouse atonal 
homolog-1) is such a factor, and is expressed in the hindbrain, dorsal spinal cord, external 
germinal layer of the cerebellum, gut, joints, ear and Merkel cells of the skin (which fimction 
as mechanoreceptors) (Akazawa et a/.,1995; Ben-Arie et al, 1996; BenrArie et al, 1997). 
Mice heterozygous for a targeted deletion Mathl {Mathl*'') are viable and appear normal, 
but Mathl null mice {Math''') die shortly after birth and lack cerebellar granule neurons. 

[0008] Mathl is one of ato's closest known homologs, with 82% amino acid 
similarity in the bHLH domain and 100% conservation of the basic domain that determines 
target recognition specificity (Ben-Arie et al, 1996; Chien et al, 1996). Mathl is transiently 
expressed in the CNS starting at embryonic day 9 (E9) in the dorsal portion of the neural 
tube. Mathl is also expressed in the rhombic lip of the fourth ventricle of the brain, where 
cerebellar granule cell precursors are bom at E13-15 (Alder et al, 1996). Upon proliferation 
and differentiation, these progenitor cells migrate to form the external granule layer (EGL) of 
the cerebellar primordia (Hatten and Heintz, 1995). Proliferating EGL cells continue to 
express Mathl during the first three postnatal weeks, until shortly before they migrate to their 
final adult destination to generate the internal granule layer (IGL) of the cerebellum 
(Akazawa et al, 1995; Ben-Arie et al, 1996). Another group of cells, a small population of 
neuronal precursors in the dorsal spinal cord, expresses Mathl during E10-E14 (Akazawa et 
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al., 1995; Ben-Arie et aL, 1996). These precursor cells also express the LIM homeodomain 
proteins (LH2A and LH2B), markers of the Dl class of commissural intemeurons (Lee et aL, 
1998). Hehns and Johnson (1998) reported that lacZ expression under the control of Mathl 
regulatory elements reproduced Mathl expression pattems in the developing cerebellum and 
spinal cord, and demonstrated that Mathl is expressed in precursors that give rise to a 
subpopulation of dorsal commissural intemeurons. 

[0009] To determine the in vivo function of Mathl, the inventors generated mice 
{Mathl''') lacking the MATHl protein. This null mutation causes major cerebellar 
abnormalities: lack of granule cell proliferation and migration from the rhombic lip at E14.5, 
and absence of the entire EGL at birth (Ben-Arie et al, 1997). It is not clear whether the 
agenesis of cerebellar granule neurons is due to failure of progenitor specification or the cells' 
inability to proliferate and/or differentiate. Neonates cannot breathe and die shortly after 
birth, but there are no gross defects in any cranial nerves or brain stem nuclei that could 
explain respiratory failure. 

[00101 The fact that Mathl is expressed in the inner ear suggests that Mathl 
expression is necessary for the development of auditory or balance organs. The inner ear 
initially forms as a thickening of the ectoderm, termed the otic placode, between 
rhombomeres 5 and 6 in the hindbrain. The otic placode gives rise to neurons of the Vlllth 
cranial nerve and invaginates to become the otocyst, from which the inner ear will develop. 
The mature manunalian inner ear comprises one auditory organ, the cochlea, and five 
vestibular organs: the utricle, the saccule, and three semicircular canals. The sensory 
epithelia of these organs consist of mechanoreceptive hair cells, supporting cells and nerve 
endings. Hair cells serve as mechanoreceptors for transducing sound waves and head motion 
into auditory and positional inforaiation. Hair cells and supporting cells both arise from a 
common progenitor cell and proliferate and differaitiate within the sensory epithelia, with 
peak mitoses between embryonic day 13 and 18 (E13-18) in mice. Alttiough several genes 
have been implicated in the development of the inner ear, such as int2 (Mansour et al, 1993; 
pax2 (Torres et al, 1996; and Hmx3 (Wang et al, 1998). None have been shown to be 
required for the genesis of hair cell specifically. 

[00111 Damage to hair cells is a conunon cause of deafiiess and vestibular 
dysfimction, which are themselves prevalent diseases. Over 28 million Americans have 
impaired hearing; vestibular disorders affect about one-quarter of the general population, and 
half of our elderly. The delicate hair cells are vuhierable to disease, aging, and environmental 
trauma (t.e., antibiotics, toxins, persistent loud noise). Once these cells are destroyed, they 
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cannot regenerate in mammals. Therefore, a need exists to address the problems of patients 
with congenital, chronic or acquired degenerative hearing impairment and loss or balance 
problems, and to provide compositions, methods and reagents for use in treating hearing loss 

and vestibular function. 

[0012] In support of the teaching of the present invention, others have 
demonstrated that Mathl, upon overexpression, induces significant production of extra hair 
cells in postnatal rat inner ears (Zheng and Gao, 2000). Briefly, although fate determination 
is usually completed by birth for mammalian cochlear hair cells, ovwexpression of Mathl in 
postnatal rat cochlear explant cultures results in additional ear hair cells which derive fi-om 
columnar epithelial cells located outside the sensory epithelium in the greater epithelial ridge. 
Furthermore, conversion of postnatal utricular supporting cells into hair cells is faciUtated by 
Mathl expression. The ability of Mathl to permit production of hair cells in the ear is strong 
evidence in support of the claimed invention. 

[0013] In addition to Mathl being involved in governing differentiation of 
neuronal cells, including sensory cells in the inner ear, Mathl is involved in intestinal 
development, as described herein. The mouse gut begins developing at embryonic day 7.5 
(E7.5). Invagination of the most anterior and posterior endoderm leads to the fomiation of the 
foregut and hindgut pockets, respectively, which extend toward each other and fuse to form 
the gut tube. By El 5.5, the gut appears as a pooriy differentiated, pseudostratified epithelium. 
From El 5.5 to E19, nascent villi with a monolayer of epithelial cells develop in a duodenum- 
to-colon pattern. During the first two postnatal weeks, the intervillus epithelium, where 
proliferating and less differentiated cells reside, develops into the crypts of Lieberkiihn. Stem 
cells in the intervillus epithelium (during embryogenesis) or crypts (in adulthood) give rise to 
four principle cell types: absorptive enterocytes or columnar cells, mucous-secreting goblet 
cells, regulatory peptide-secreting enteroendocrine cells in the large and small intestines, and 
antimicrobial peptide-secreting Paneth cells in the small intestine only. Enterocytic, goblet, 
and enteroendocrine cells continue to differentiate and mature while migrating up the villus, 
and are finally extruded into the lumen at the tip. This journey takes about 2 to 3 days. The 
Paneth cells migrate downward and reside at the base of the crypt for 21 days before being 
cleared by phagocytosis (Cheng and Leblond, 1974; Gordon et al, 1992; Back et al., 2000). 

[0014] The epithelial-mesenchymal interaction has been shown to be critical in 
the pioxunal-distal, crypt-villus patterning during gut development. A number of signaling 
molecules and transcription factors are involved in these processes (Kaestner et al., 1997; 
Pabst et al, 1999; Beck et al, 2000; Clatworthy and Subramanian, 2001). Previous studies 
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have suggested that all four epithelial cell lineages originate from a common ancestor (Cheng 
and Leblond, 1974; Gordon et al., 1992; Back et al., 2000; Bjerknes and Cheng, 1999), but 
the mechanisms that control the epithelial lineage differentiation are not well understood. T 
cell factor-4 (Tcf4) plays a role in the stem cell maintenance m the small intestine but does 
not induce epithelial cells to differentiate into enterocytes or goblet cells (Korlinek et al., 
1998). The present invention is directed to methods and compositions utilizing Mathl, 
particularly in gut development, as it is expressed in the gut (Akazawa et al., 1995) in 
addition to being involved in cell fate determination in the nervous system (Ben-Arie et al, 
2000; Betmingham et al, 2001). 

SUMMARY OF THE INVENTION 

[00151 In one embodiment of the present invention there is an animal having a 
heterologous nucleic acid sequence replacing an allele of an o/onaZ-associated nucleic acid 
sequence under conditions wherein said heterologous sequence inactivates said allele. In a 
preferred embodiment said heterologous sequence is expressed under control of an atonal- 
associated regulatory sequence. In a specific embodiment both a/o/iaZ-associated alleles are 
replaced, hi an additional specific embodiment both a/ono/-associated alleles are replaced 
with nonidentical heterologous nucleic acid sequences. In an additional embodiment said 
animal has a detectable condition wherein said condition is selected from the group 
consisting of loss of hair cells, cerebellar granule neuron deficiencies, hearing impairment, 
imbalance, joint disease, osteoarthritis, abnormal proliferation of neoplastic neuroectodermal 
cells and formation of meduUoblastoma. Li another embodiment of the present invention said 
heterologous nucleic acid sequence is a reporter sequence selected from the group consisting 
of b-galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), 
neomycin, kanamycin, luciferase, p-glucuronidase and chloramphenicol transferase (CAT). 
In another specific embodiment said reporter sequence regulatable or is expressed in brain 
tissue, neural tissue, skin tissue, non-ossified cartilage cells, joint chondrocytes, Merkel cells, 
iimer ear sensory epithelia and brain stem nuclei. In additional specific embodiments said 
atonoZ-associated allele is replaced with an a/o/ia/-associated nucleic acid sequence under 
control of a regulatable promoter sequence or a tissue-specific promoter sequence wherein 
said tissue is selected from the group consisting of brain tissue, neural tissue, skin tissue, non- 
ossified cartilage cells, joint chondrocytes, Merkel cells, inner ear sensory epithelia and brain 
stem nuclei. In additional embodiments said animal is a mouse, Drosophila, zebrafish, frog, 
rat, hamster or guinea pig. 
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(00161 In another embodiment of the present invention is a method for screening 
for a compound in an animal, wherein said compound affects expression of an atonal- 
associated nucleic acid sequence comprising delivering said compound to said animal 
wherein said animal has at least one allele of an otonaZ-associated nucleic acid sequence 
inactivated by insertion of a heterologous nucleic acid sequence wherein said heterologous 
nucleic acid sequence is under control of an atowaZ-associated regulatory sequence, and 
monitoring for a change in said expression of said atonal-associated nucleic acid sequence. In 
specific embodiments said compound upregulates or downregulates said expression of an 
fltowoZ-associated nucleic acid sequence. In additional embodiments said animal is a mouse 
or Drosophila. In a specific embodiment the heterologous nucleic acid sequence is a reporter 
sequence. In an additional specific embodiment the heterologous nucleic acid sequence is 
selected from the group consisting of p-galactosidase, green fluorescent protein (GFP), blue 
fluorescent protein (BFP), neomycin, kanamycin, luciferase, p-glucuronidase and 
chloramphenicol transferase (CAT). 

[00171 Another embodiment of the present invention is a compound which affects 
expression of an oto/ioZ-associated nucleic acid sequence. In specific embodiments said 
compound upregulates or downregulates expression of an atowaZ-associated nucleic acid 
sequence. In a specific embodiment said compound affects a detectable condition in an 
animal wherein said condition is selected fi-om the group consisting of loss of hair cells, 
cerebellar granule neuron deficiencies, hearing impairment, an imbalance disorder, joint 
disease, osteoarthritis, abnormal proliferation of neoplastic neuroectodermal cells and 
formation of meduUoblastoma. 

[0018] Another embodiment of the present invention is a method for screening for 
a compound in an animal, wherein said compound affects a detectable condition in said 
animal, comprising delivering said compound to said animal wherein at least one allele of an 
flto/iaZ-associated nucleic acid sequence in said animal is inactivated by insertion of a 
heterologous nucleic acid sequence, wherein said heterologous nucleic acid sequence is under 
the control of an a/o/jaZ-associated regulatory sequence, and monitoring said animal for a 
change in the detectable condition. In a specific embodiment said detectable condition is 
selected from the group consisting of loss of hair cells, cerebellar granule neuron 
deficiencies, hearing impairment, an imbalance disorder, joint disease, osteoarthritis, 
abnomial proliferation of neoplastic neuroectodennal cells and formation of 
meduUoblastoma. In another embodiment said delivery of said compound affects expression 
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of said heterologous nucleic acid sequence. In specific embodiments said expression of said 
heterologous nucleic acid sequence is upregulated or downregulated. In additional specific 
embodiments said animal is a mouse, Drosophila, zebrafish, &og, rat, hamster or guinea pig. 

10019] Another embodiment of the present invention is a compound wherein said 
compound affects said detectable condition. In specific embodiments said compound affects 
expression of a heterologous nucleic acid sequence. In additional specific embodiments said 
compound upregulates or downregulates expression of a heterologous nucleic acid sequence. 

[0020] In other embodiments of the present invention are methods of treating an 
animal, including a human, for cerebellar granule neuron deficiencies, for promoting 
mechanoreceptive cell growth, for generating hair cells, for treating hearing impairment or an 
imbalance disorder, for treating a joint disease, for treating for an abnormal proliferation of 
cells, and for treating for a disease that is a result of loss of fimctional fltonaZ-associated 
nucleic acid or amino acid sequence. Said methods include administering a therapeutically 
effective amount of an atonaZ-associated nucleic acid or amino acid sequence, hi specific 
embodiments said administration is by a vector selected from the group consisting of an 
adenoviral vector, a retroviral vector, an adeno-associated vector, a plasmid, or any other 
nucleic acid based vector, a liposome, a nucleic acid, a peptide, a lipid, a carbohydrate and a 
combination thereof of said vectors. In a specific embodiment said vector is a non-viral 
vector or a viral vector. In another specific embodiment said vector is a cell. In a preferred 
embodiment said vector is an adenovirus vector comprising a. cytomegaloviras IE promoter 
sequence and a SV40 early polyadenylation signal sequence. In another specific embodiment 
said cell is a human cell. In an additional specific embodiment said joint disease is 
osteoarthritis. In a specific embodiment said aro/ia/-associated nucleic acid or amino acid 
sequence is Hathl or Mathl. In another specific embodiment the cell contains an alteration 
in an ato/ia/-associated nucleic acid or amino acid sequence. In an additional specific 
embodiment said amino acid sequence has at least about 80% identity to about 20 contiguous 
amino acid residues of SEQ ID NO:58. In an additional specific embodiment the nucleic acid 
sequence encodes a polypeptide which has at least about 80% identity to about 20 contiguous 
amino acid residues of SEQ ID N0:S8. 

[0021] In another embodiment of the present invention is a method for treating an 
animal for an abnormal proliferation of cells comprising altering aro/ia/-associated nucleic 
acid or amino acid sequence levels in a cell. In a specific embodiment said alteration is 
reduction or said nucleic acid or amino acid sequence contains an alteration. 
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[0022] In another embodiment of the present invention is a composition 
comprising an a/owaZ-associated amino acid sequence or nucleic acid sequence in 
combination with a delivery vehicle wherein said vehicle delivers a therapeutically effective 
amount of an ato/ia/-associated nucleic acid sequence or amino acid sequence into a cell. In 
specific embodiments said vehicle is the receptor-binding domain of a bacterial toxin or any 
fusion molecule or is a protein transduction domain. In a specific embodiment said protein 
transduction domain is firom the HIV TAT peptide. In a specific embodiment said atonal- 
associated amino acid sequence or nucleic acid sequence is Hathl or Mathl. 

[0023] In another embodiment of the present invention there is a composition to 
treat an organism for loss of hair cells, wherein said organism comprises a defect in an 
atowflZ-associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said aro/jaZ-associated nucleic acid sequence. In another specific embodiment 
the defect affects a regulatory sequence of said ato/ia/-associated nucleic acid sequence. In 
an additional embodiment of the present invention there is a composition to treat an organism 
for loss of hair cells, wherein said organism comprises defect in a nucleic acid sequence 
which is associated with regulation of an a/o/iaZ-associated nucleic acid sequence. In an 
additional embodiment of the present invention there is a composition to treat an organism 
for loss of hair cells, wherein said organism comprises a defect in an amino acid sequence 
which is associated with regulation of an a/o/ia/-associated nucleic acid sequence. 

[0024] In another embodiment of the present invention there is a composition to 
treat an organism for a cerebellar neuron deficiency, wherein said organism comprises a 
defect in an atonaZ-associated nucleic acid sequence. In a specific embodiment the defect is a 
mutation or alteration of said a/onaZ-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said ato/iaZ-associated nucleic acid 
sequence. In an additional embodiment of the present invention there is a composition to 
treat an organism for a cerebellar neuron deficiency, wherein said organism comprises defect 
in a nucleic acid sequence which is associated with regulation of an a/onoZ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a composition 
to treat an organism for a cerebellar neuron deficiency, wherein said organism comprises a 
defect in an amino acid sequence which is associated with regulation of an a/onaZ-associated 
nucleic acid sequence. 

[0025] In another embodiment of the present invention there is a composition to 
treat an organism for hearing impairment, wherein said organism comprises a defect in an 
flto/iflZ-associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
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alteration of said flto/ia/-associated nucleic acid sequence. In another specific embodiment 
the defect affects a regulatory sequence of said atowaZ-associated nucleic acid sequence. In 
an additional embodiment of the present invention there is a composition to treat an organism 
for hearing impairment, wherein said organism comprises defect in a nucleic acid sequence 
which is associated with regulation of an atowaZ-associated nucleic acid sequence. In an 
additional embodiment of the present invention there is a composition to treat an organism 
for hearing impairment, wherein said organism comprises a defect in an amino acid sequence 
which is associated with regulation of an aro/iaZ-associated nucleic acid sequence. 

[0026] In another embodiment of the present invention there is a composition to 
treat an organism for imbalance, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said a/o«aZ-associated nucleic acid sequence. In another specific embodiment 
the defect affects a regulatory sequence of said a/onaZ-associated nucleic acid sequence. In 
an additional embodiment of the present invention there is a composition to treat an organism 
for imbalance, wherein said organism comprises defect in a nucleic acid sequence which is 
associated with regulation of an ato/iaZ-associated nucleic acid sequence. In an additional 
embodiment of the present invention there is a composition to treat an organism for 
imbalance, wherein said organism comprises a defect in an amino acid sequence which is 
associated with regulation of an a/OAiaZ-associated nucleic acid sequence. 

- [0027] In another embodiment of the present invention there is a composition to 
treat an organism for osteoarthritis, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said aronaZ-associated nucleic acid sequence. In another specific embodiment 
the defect affects a regulatory sequence of said aZonaZ-associated nucleic acid sequence. In 
an additional embodunent of the present invention there is a composition to treat an organism 
for osteoarthritis, wherein said organism comprises defect in a nucleic acid sequence which is 
associated with regulation of an a^oMaZ-associated nucleic acid sequence. In an additional 
embodiment of the present invention there is a composition to treat an organism for 
osteoarthritis, wherein said organism comprises a defect in an amino acid sequence which is 
associated with regulation of an atonaZ-associated nucleic acid sequence. 

[0028] In another embodiment of the present invention there is a composition to 
treat an organism for a joint disease, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said a/o/iaZ-associated nucleic acid sequence. In another specific embodiment 
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the defect affects a regulatory sequence of said atonaZ-associated nucleic acid sequence. In 
an additional embodiment of the present invention there is a composition to treat an organism 
for a joint disease, wherein said organism comprises defect in a nucleic acid sequence which 
is associated with regulation of an aro/iaZ-associated nucleic acid sequence. In an additional 
embodiment of the present invention there is a composition to treat an organism for a joint 
disease, wherein said organism comprises a defect in an amino acid sequence which is 
associated with regulation of an a/o/iaZ-associated nucleic acid sequence. 

[0029] In another embodiment of the present invention there is a composition to 
treat an organism for abnormal proliferation of cells, wherein said organism comprises a 
defect in an aro/ia/-associated nucleic acid sequence. In a specific embodiment the defect is a 
mutation or alteration of said ato/iaZ-associated nucleic acid sequence. In another specific 
embodiment the defect affects a regulatory sequence of said atonaZ-associated nucleic acid 
sequence. In an additional embodiment of the present invention there is a composition to 
treat an organism for abnormal proliferation of cells, wherein said organism comprises defect 
m a nucleic acid sequence which is associated with regulation of an ato/iaZ-associated nucleic 
acid sequence. In an additional embodiment of the present invention there is a composition 
to treat an organism for abnormal proliferation of cells, wherein said organism comprises a 
defect in an amino acid sequence which is associated with regulation of an atonaZ-associated 
nucleic acid sequence. 

- [0030] In another embodiment of the present invention there is a composition to 
treat an organism for cancer, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said a/owaZ-associated nucleic acid sequence. In another specific embodiment 
the defect affects a regulatory sequence of said a/owaZ-associated nucleic acid sequence. In 
an additional embodiment of the present invention there is a composition to treat an organism 
for cancer, wherein said organism comprises defect in a nucleic acid sequence which is 
associated with regulation of an a/o/jaZ-associated nucleic acid sequence. In an additional 
embodiment of the present invention there is a composition to treat an organism for cancer, 
wherein said organism comprises a defect in an amino acid sequence which is associated with 
regulation of an a/o/iaZ-associated nucleic acid sequence. In a specific embodiment said 
cancer is meduUoblastoma. 

[0031] In an object of the present invention, there is a method of predicting a 
differentiation state for a stem cell, comprising the steps of obtaining the cell; and 
determining the expression status of an a^owaZ-associated sequence. In a specific 
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embodiment, the stem cell is an intestinal stem cell. In another specific embodiment, the 
stem cell is obtained from an intestinal epithelium. In an additional specific embodiment, the 
expression status of the ato/ia/-associated sequence is an upregulation of expression of the 
a/ona/-associated sequence. In another specific embodiment, the differentiation state is to a 
secretory cell of the intestine. In a specific embodiment, the secretory cell is at least one of a 
goblet cell, an enteroendocrine cell, or a Paneth cell. In an additional specific embodiment, 
the expression status of said fl/o«a/-associated sequence is a downregulation of expression of 
said ato/ia/-associated sequence. In another specific embodiment, the differentiation state is 
to an absorptive cell of the intestine. In a further specific embodiment, the flto«fl/-associated 
sequence is a polynucleotide. In another specific embodiment, the atonaZ-associated 
sequence is a polypeptide. 

[0032] In another object of the present invention, there is a method for 
differentiating a stem cell, comprising altering expression of an a/o/ifl/-associated sequence. 
In a specific embodiment, the stem cell is a gastrointestinal stem cell. In an additional 
specific embodiment, the stem cell differentiates into a secretory cell. In a specific 
embodiment, the secretory cell is at least one of a goblet cell, an enteroendocrine cell, or a 
Paneth cell. In a further specific embodiment, the stem cell differentiates into an absorptive 
cell. 

[0033] In an additional object of the present invention, there is a method of 
regenerating secretory intestinal cells in an individual, comprising the step of administering 
to the individual a stem cell and a regulatory factor for said stem cell, wherein the expression 
of an atonaZ-associated sequence is upregulated in the stem cell. In a specific embodiment, 
the secretory intestinal cell is at least one of a goblet cell, an enteroendocrine cell, or a Paneth 
cell. In another specific embodiment, the regulatory factor is a bone morphogenetic protein. 
In a specific embodiment, the bone morphogenetic protein is GDF7. 

[0034] In another object of the present invention, there is a method of 
regenerating absorptive intestinal cells in an individual, comprising the step of administering 
to the individual a stem cell and a regulatory factor for said stem cell, wherein the expression 
of an ato/ia/-associated sequence is downregulated in the stem cell. In a specific 
embodiment, the regulatory factor is a member of the HES family. 

[0035] In an additional object of the present invention there is a method of 
treating an animal for a gastrointestinal condition, comprising delivering to the animal a 
gastrointestinal stem cell. In a specific embodiment, the method fiurther comprises delivery of 
a regulatory factor. 



wo 03/047532 



PCT/US02/41458 



[0036] In another object of the present invention there is a method of treating an 
animal for a gastrointestinal condition comprising delivering a therapeutically effective 
amount of an ato«a/-associated amino acid sequence or nucleic acid sequence to a cell of said 
animal. In a specific embodiment, the gastrointestinal condition is cancer, damaged intestinal 
tissue, inflammatory bowel disease, irritable bowel syndrome, infection or necrotizing 
entercolitis. In a specific embodiment, the a/o«a/-associated amino acid sequence or nucleic 
acid sequence is Mathl. In another specific embodiment, the a/o/ia/-associated amino acid 
sequence or nucleic acid sequence is Hathl. In a further specific embodiment, the amino 
acid sequence or nucleic acid sequence is administered by a delivery vehicle. In a further 
specific embodiment, the delivery vehicle is an adenoviral vector, a retroviral vector, an 
adeno-associated viral vector, a plasmid, a liposome, a nucleic acid sequence, a peptide, a 
lipid, a carbohydrate or a combination thereof In an additional specific embodiment, the 
delivery vehicle is selected from the group consisting of a viral vector or a non-viral vector. 
In another specific embodiment, the cell contains an alteration in an ato/ia/-associated nucleic 
acid sequence or amino acid sequence. In an additional specific embodiment, the amino acid 
sequence has at least about 80% identity to about 20 contiguous amino acid residues of SEQ 
ID NO:58 {Hathl). In a fiirther specific embodiment, the nucleic acid sequence encodes a 
polypeptide which has at least about 80% identity to about 20 contiguous amino acid residues 
ofSEQIDNO:58 (Hathl). 

[0037] In another object of the present invention, there is a composition in a 
pharmaceutical carrier, comprising at least one stem cell, wherein the cell is upregulated for 
expression of an a/o/jfl/-associated sequence; and at least one regulatory factor. In a specific 
onbodiment, the stem cell is a gastrointestinal stem cell. 

[0038] In an additional specific embodiment, the composition in a pharmaceutical 
carrier, comprising at least one stem cell, wherein the cell is dov^mregulated for expression of 
an fltonaZ-associated sequence; and at least one regulatory factor. In a specific embodiment, 
the stem cell is a gastrointestinal stem cell. 

[0039] In another object of the present invention, there is a method of treating an 
individual for a gastrointestinal condition, comprising the step of administering to the 
individual a composition described herein. 

[0040] In an additional object of the present invention, there is a method for 
screening for a compound in an animal, wherein said compound affects a detectable 
gastrointestinal condition in said animal, comprising delivering the compound to said animal 
wherein at least one allele of an afonaZ-associated nucleic acid sequence in said animal is 
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inactivated by insertion of a heterologous nucleic acid sequence, wherein said heterologous 
nucleic acid sequence is under the control of an flto/ia/-associated regulatory sequence, and 
monitoring said animal for a change in the detectable gastrointestinal condition. In a specific 
embodiment, the delivery of the compound affects expression of the hetCTologous nucleic 
acid sequence. In a specific embodiment, the compound affects the detectable condition. 

10041] In another object of the present invention, there is a kit comprising an 
intestinal stem cell. In a specific embodiment, the kit further comprises a regulatory protein. 

[00421 In an additional object of the present invention, there is a method of 
treating an animal for a disease that is a result of loss of fimctional atonal-associated nucleic 
acid or amino acid sequence comprising delivering a therapeutically effective amount of an 
atonal-associated amino acid sequence or nucleic acid sequence to a cell of said animal. In a 
specific embodiment, the disease is a gastrointestinal disease. 

[0043] Other and fiirther objects, features and advantages would be apparent and 
eventually more readily understood by reading the following specification and by reference to 
the company drawing forming a part thereof, or any examples of the presently preferred 
embodiments of the invention are given for the purpose of the disclosure. 

DESCRIPTION OF THE DRAWINGS 

[0044]^ Figure 1 A and IB demonstrate that the inner ear p-Gal staining (blue) of 
Mathl heterozygous embryos as described hereinabove. Figure lA shows the otic vesicle 
(OV) at E12.5 and Figure IB the inner ear at E14.5 of Matht'^'^''^ embryos. Sensory 
epitiielia stained positively in the cochlea (C), saccule (S), utricle (U), and semicircular canal 
ampullae (SCA). A schematic diagram of the inner ear is depicted alongside the staining for 
reference, blue indicates location of the sensory epithelia. The original magnifications of the 
images taken under the microscope were xlOO for Figure 1 A and x50 for Figure IB. 

[0045] Figures 2A through 2F are scanning electron micrographs of E18.5 irmer 
ear sensory epitiielia in wild-type and Mathl^-^'"^-^^' mice. Wild-type mice epitiielia are 
shown in Figures 2A, 2C, and 2E and null mouse epitiielia in Figures 2B, 2D, and 2F. The 
organ of Corti of ttie cochlea are shown and indicated in Figures 2 A and 2B. In tiie wild-type 
mouse ttiere are tiu-ee rows of outer hair cells (1, 2, 3), one row of inner hair cells (I), all witii 
hair bundles (HB). The tectorial membrane (TM), an accessory strocture of tiie cochlea, can 
be observed at tiie bottom. Above the sensory epitiielium are squamous cells (SQ) witii 
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radimentary kinociUa (RK). In null mice (Figure 2B), there are only squamous cells. Crista 
ampullaries of a vertical semicircular canal are depicted in Figures 2C and 2D. The null 
mouse crista is similar to the wild-type in overall shape, including the septum (eminentia) 
cruciatum (EC), but is smaller. The macula of the uticle is the focus of Figure 2E and 2F. 
Again, the macula of the null mouse is smaller than the wild-type. Scale bars are as follows: 
10 nm in Figures 2 A and 2B, 50 ^m in Figures 2C and 2D, and 100 ^m in Figures 2E and 2F. 

[0046] Figures 3A through 3F are light micrographs of semi-thin transverse 
sections of inner ear sensory epithelia in wild-type mice (Figures 3A, 3C, and 3E) and 
Mathl^'°^^^ (Figures 3B, 3D, and 3F), all mice were observed at El 8.5. As observed in 
the cochlea of wild-type mice. Figure 3 A, three outer hair cells (1, 2, 3) and one inner (I) hair 
cell are present. Conversely, the null mouse cochlea (Figure 3B) has only squamous cells 
(SQ) in the same region. Hair cells (HC) and supporting cells (SC) are present in the wild- 
type crista ampuUaris (Figure 3C) and utricular macula (Figure 3E), but only supporting cells 
are present in null mice (Figure 3D and 3F). The crista was cut obliquely, accounting for the 
multiple layers of hair cells in Figure 3C. The otoUthic membrane (OM), an accessory 
structure of the utricle, is present in both wild-type mice (Figure 3E) and null mice (Figure 
3F). Scale bars equal 100 jim in (Figures 3A and 3B); 50 jim in (Figures 3C and 3D); and 25 
Hm in (Figures 3E and 3F). 

[00471 Figures 4A and 4B are transmission electron micrographs of El 8.5 
utricular macula in wild-type and Mathl^'^^^ mice. Figure 4A shows that the hair cells 
(HC) and supporting cells (SC) are present in wild-type utricular macula. By contrast, only 
supporting cells are present in the null mouse (Figure 4B). Hair cells have hair bundles (HB) 
and supporting cells have miicrovilli (MV). Hair cells are less electron-dense and have more 
apical nuclei than supporting cells, but only the latter have secretory granules (SG). Some 
immature hair cells (IM) are evident in the wild-type, but not in the null mouse. The scale 
bar in all the figures equals 10|im. 

[0048] Figures 5A through 5F show the Calretinin staining pattern of inner ear 
sensory epithelia. Sections through the utricle of E16.5 wild-type (Figures A5 and 5C) and 
Mathl (Figure 5B and 5D) littermates were counterstained with propidium iodide 

(red) for confocal microscopy. Sections were cut through the crista ampuUaris of E18.5 wild- 
type (Figure 5E) and Mathl ^^^"^ (Figure 5F) were counterstained with DAPI (blue) for 
immunofluorescent microscopy. The crista is cut at an oblique angle, which accounts for the 
multiple layers of hair cells in (Figure 5E). Inununostaining of Cahetinin (green, arrows) is 
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evident in hair cells of wild-type (Figures 5A, 5C, and 5E) but not null mice (Figures 5B, 5D, 
and 5F). Boxed areas in Figures 5A and 53 indicate the regions magnified in Figures 5C and 
5D. Scale bar equals 100 \im in (Figure 5A and 5B), 15 \im in (Figures 5C and 5D) and an 
original magnification of x200 in (Figures 5E and 5F). 

[0049] Figure 6A and 6B show the expression pattem of Afathl in mouse articular 
cartilage using the Mathf'^'^^^ heterozygote. Figure 6A shows the staining pattem of a P 14 
mouse forelimb and demonstrates expression in all joints. Figure 6B is a magnification 
(20X) of an elbow jomt from the same mouse tiiat demonstrates that Mathl is expressed 
exclusively in the non-ossified articular chondrocytes. 

[0050] Figure 7A through 7C show replacement of Mathl coding region by lacZ 
gene. Figure 7A, Top, has a map of the Mathl genomic locus. The coding region is shown 
as a black box. The sites of the probes used to detect the wild-type and mutant alleles are 
shown as black bars. The targeting vector is in the middle with the sites for homologous 
recombination indicated by larger Xs. In the targeted locus shown at the bottom, lacZ is 
translated under the control of Mathl regulatory elements. Figure 7B demonstrates Southern 
blot analysis of embryonic stem cells using the 3* external probe. The upper band represents 
wt allele and the lower band the targeted mutant allele (mut) in targeted clones. Figure 7C 
demonstrates Southern blot analysis of DNA from the progeny of heterozygous mice 
demonstrating the presence of the targeted allele and absence of the wild-type allele in Mathl 
p-gavp-gai jj^jjjg (asterisks). The abbreviations are as follows: (A) Apal; (H) HindlH; (RI) 
EcoRI; (S) Sail; and (X) Xbal. 

[0051] Figures 8A through 8H show MathlllacZ expression and cerebellar 
phenotype in Matht'^^ and Mathl^'^^^'^^ mice. Figure 8A shows Mathl/lacZ expression 
in the dorsal neural tube at E9.5 and (Figure 8B) E10.5. Figure 8C indicates a section 
through the hind brain at El 0.5 has Mathl/lacZ expression in the dorsal portion (arrows). 
Figure 8D demonstrates that in a spinal cord section from E12.5 embryo, dorsal cells migrate 
ventrally (arrows). Figure 8E shows at E14.5 expression is observed in the EGL progenitors 
at the rhombic lip and in migrating cells that will populate the EGL. Figure 8F demonstrates 
in Mathl^^^^^ mice, Mathl/lacZ expression is limited to a few cells in the rhombic lip, 
which is significantly reduced in size. Figure 8G shows that at PO Mathl/lacZ is expressed in 
the EGL. Figure 8H demonstrates that the EGL is absent in the null mice. Original 
magnification for Figures 8C through 8H waslOOx. 
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[0052] Figure 9A through 9G shows expression of Mathl/lacZ in the inner ear 
and brain stem and histological analysis of ventral pontine nucleus. X-gal staining of El 8.5 
Mathl*^ utricular crista (Figure 9A) and inner ear sensory epithelia of Mathl*'^ (Figure 
9B) and A/afA;^*"*^*"' (Figure 9C). The Mathl/lacZ expression in the upper hair cell layer of 
the sensory epithelia of (Figures 9A and 9B) and the characteristic calyx appearance 
(anowhead). In the null mice X-gal staining of epithelial cells is non-specific in the absence 
of hair cells (Figure 9C). Whole-brain X-gal staining of Mathl*'^-^* (Figure 9D) and 
Math! P-s'^P-s'' (Figure 9E) at El 8.5 is demonstrated. There is positive staining of the pontine 
nucleus (arrowhead) and cerebellum (arrow) in Matht'^ mice, which is lacking or greatly 
reduced in null mutants in both the cerebellum, and pontine nucleus (inset). Figures 9F and 
9G show haematoxylin and eosin staining of sagittal sections through the pons of a wild type 
and null mutant (Figures 9F and 9G, respectively), showing the loss of the ventral pontine 
nucleus in null mutants. The original magnifications were as follows: (A) 400x (B & C) 
lOOOx. (D & E) 8x, inset in D & E lOOx, (F & G) lOx. 

[0053] Figures lOA through IDE demonstrate Mathl/lacZ is expressed in joint 
chondrocytes. X-gal staining of whole embryos at (Figure IDA) E12.5 and (Figure lOB) 
El 6.5 illustrates that Mathl/lacZ is expressed in all joints (Figure IOC). Horizontal section 
through the elbow joint of El 8.5 Matht'^'^^ mouse shows that it is expressed in resting 
chondrocytes (arrow). Figure lOD shows a horizontal section through a humero-radial joint 
at PIG that has expression in the articular chondrocytes (arrowhead) and resting chondrocytes 
(arrow). Figure lOE shows high magnification of a section through a wrist joint indicating 
Mathl/lacZ is expressed in articular chondrocytes. The original magnification is as follows: 
(C) lOx; (D) 20x; and (E) 40x. 

[0054] Figures 1 1 A through IIL show Mathl/lacZ expression in Mericel cells. To 
identify the structures stained on the hairy and non-hairy skin. El 6.5 littermate embryos were 
stained as whole mounts, sectioned, and microscopically examined. Shown are sections 
through the vibrissae (Figure 11 A), foot pad at low (Figure IIB) and high magnification of 
the region marked by an arrow in B (Figure IOC), and hairy skin (Figure IID). In all 
sections the localization of the stained cells was as expected fi-om Merkel cells. To look for 
macroscopic defects in null mice, close-up pictures were taken through a stereomicroscope of 
Mathl*'^^ (control, panels E-H) and Mathl ^^^-^ (null, panels I-L) littermate mice. 
Staining in null mice appeared stronger because of a dosage effect in the vibrissae (E, I), limb 
joints (F, J), and foot pads (G, K). In contrast, the staining intensity of null (J, L) mice was 
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markedly weaker than that of heterozygous (F, H) mice in the touch domes associated with 
the hairy skin. The original magnification was follows: A x200; B x50; C x400; D x500; E- 
G-H-I-K-Lx32;F-Jxl6. 

[00551 Figures 12A through 12E show lack of lacZ-stained touch domes in Tabby 
mice. Tabby/Tabby females were crossed with Matht'^^ males, and their progeny were X- 
gal stamed and gender-determined at E16.5. Staining around primary vibrissae in the snout 
was detected in both female embryos heterozygous for the Tabby mutation (Figure 12A) and 
male embryos hemizygous for the mutation (Figure 12B). Secondary vibrissae, which are 
known to vary in number in the Tabby mutants (black arrows), were also stained. The 
staming of the touch domes was less intense in the Tabby/X female (Figure 12D) than 
Matht'^'^^ (wt for Tabby) embryos (Figure 12C), since Tabby is a semidominant mutation. 
However, patches of stained touch domes were detected in a female embryo that carried a 
wild-type allele at the Tabby locus (Figure 12A, red arrow, and 12D). In contrast, a 
hemizygous male completely lacked both staining and touch domes, due to the loss of hair 
follicles that abolishes the development of Merkel cells (Figures 12B and 12E). 

[0056] Figures 13 A through 13F demonstrate marker analysis of Meikel cells in 
wild type and Mathl null mice. Skin sections from Mathl*'* and Mathl ^-^^^ reacted with 
antibodies against MATHl (Figures 13A and 13B), cytokeratin 18 (Figures 13C and 13D), 
and chromogranin A (Figures 13E and 13F). Polyclonal antibodies to MATHl identify 
multiple basal nuclei in rare abdominal hair follicles of wild type (Figure 13 A) but not mutant 
mice (Figure 13B). Monoclonal antibodies to cytokeratin 18 and chromogranin A identify 
Merkel cells in both wild type (Figures 13C and 13E) and mutant (Figures 13D and 13F) 
mice. The original magnification was lOOX. 

[0057] Figures 14A through 14G show Mathl rescues the lack of chordotonal 
neurons in Drosophila ato mutant embryos. Figure 14A shows a dorsal view of the thorax of 
a wild-type fly. Note there are regular array of bristles or macrochaetae. Figure 14B shows a 
similar view of a transgenic fly in which Mathl was overexpressed using the UAS/GAL4 
system (Brand and Perrimon, 1993). This ectopic expression leads to numerous extra bristles 
that are extemal sensory organs (another type of mechano receptor), not CHOs. Ectopic 
CHOs were produced in many other regions. Figure 14C shows a lateral view of two 
abdominal clusters containing 6 CHOs in addition to extemal sensory organs, revealed by a 
neuronal-specific antibody (Mab 22C10). The 5 lateral CHOs form a cluster, and the sixth is 
dorsal to the cluster. Figure 14D shows a similar view of an ato mutant embryo showing lack 
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of the CHOs. Figure 14E demonstrates ubiquitous expression of Math 1 induces new CHO 
neurons in ato mutant embryos in the proper location. Figure 14F shows in situ hybridization 
of whole mount third instar brain using the ato cDNA as a probe. Note expression in the 
developing optic lobes ("horse shoe" expression patterns) and two punctate clusters of cells in 
the middle of the brain lobes (arrow heads). Figure 14G shows Afathl expression in 
Drosophila induces CHO formation in normal and ectopic locations. The (+) indicates 
presence of CHOs and (-) indicates their absence. Number of (+) in the first column is used 
to quantify the relative increase in the number of CHOs observed when Mathl is expressed. 

[0058] Figure 15 shows Mathl ILacZ expression detected by X-gal staining. 
Mathl ILacZ expression in El 8.5 intestines (A to D). Cross section of Mathl^'°^^* ileum (A) 
and colon (C), Mathl ileum (B)and colon (D); Arrows indicate sparse lacZ-positive 
cells in heterozygous animals. MathllLacZ expression in 5-month-old Mathl mice (E 
andF). Longitudinal section of jejunum (E) and colon (F). Original magnification, X400. 

[00591 Figure 16 illustrates Mathl /lacZ expression detected by X-Gal staining. 
Longitudinal section of duodenum (A) and ileum (B) from five-month old Mathl^'^"^* mice. 
Original magnification 400x. 

[00601 Figure 1 7 demonstrates a lack of goblet and enteroendocrine cells in E18.5 
intestines. H&E staining reveals several goblet cells in wild-type duodenum [arrowheads in 
(A)l and non in null mutant (B); Alcian blue staining shows positively stained goblet cells in 
wild-type ileum [(arrowheads in (C)] but none in Mathl null ileum (D). Serotonin- 
positive enteroendocrine cells [red-stained cells in (E), the arrow points to the cell enlarged in 
the inset] are evident in wild-type (E) but not Mathl^^' (F) jejunum. Original 
magnification, X200. 

[0061] Figure 18 shows electron microscopy, cryptdin RT-PCR and 
colocalization of MathllLacZ and proliferation marker Ki-67.(A) EM of the ileum reveals 
goblet cells (G)and enteroendocrine cells (E) in wild-type mice; neither of these secretory 
cells is formed in the Mathl null mice (B ). Enterocytes [arrowheads in (A)and (B)] appear 
normal in Mathl null mice (B). Cryptdin mRNA (C) was detected in wild-type duodenum, 
jejunum, and ileum, but not in colon, whereas the Mathl null mutant lacked cryptdin RNA in 
all intestinal tissues examined. G6PDH mRNA level was used as a control. X-gal and Ki-67 
antibody staining (blue cytoplasmic and red nuclear, respectively) in sections from adult 
duodenum (D) and ileum (E) (no hematoxylin counterstaining was applied). Paneth cells 
with ^ical granules, located at the bottom of crypts (arrow), show no Ki-67 staining. A 
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subset of Ki-67-positive cells are also Mathl/LacZ -positive (arrowheads).Original 
magnification, X2500 (A and B), XIOOO (D and E). 

[0062] Figure 19 shows alkaline phosphatase and lactase activity of enterocytes in 
El 8.5 intestine. The dark purple staining of the enterocyte brush border indicates that 
alkaline phosphatase activity (arrows) is similar in wild-type (A) and Mathl null (B) 
intestines. Arrowheads in the lower panels indicate that lactase activity (the dark blue line) is 
also similar in wild type (C) and null (D) intestines. The arrow in (D) indicates clustered b- 
galactosidase activity in the intervillus region of Mathl null intestine. Original magnification 
400x. 

[0063] Figure 20 demonstrates colocalization of Mathl/LacZ and proliferation 
marker Ki-67. X-Gal and Ki-67 immunostaining in hematoxylin counter-stained sections 
fi-om duodenum (A) and ileum (B) fi-om a five-month-old Mathl^'^''^^ mouse. Several cells 
in the crypts are Ki-67-positive (brown nuclear staining); a subset of these cells also express 
Mathl/lacZ (blue cytoplasmic staining, arrows). Insets provide enlarged views of the boxed 
areas. Original magnification 400x. 

[0064] Figure 21 shows expression of Notch components and model for epithelial 
cell lineage differentiation in mouse intestine. (A) El 8.5 small intestines were subjected to 
RT-PCR using primers specific to the indicated genes. G6PDH mRNA level served as a 
control. (B) Mathl is essential for secretory cells. Abbreviations: Sec, secretin; L, 
glucagons/peptide- YY; CCK, cholecystokinin;- SP, substance P;- 5HT, serotonin; Som, 
somatostatin; GIP, gastric inhibitory peptide; Gas, gastrin. 

[0065] Figure 22 illustrates Hes-1 inununohistochemistry of El 8.5 intestine. 
Jejunum sections firom El 8.5 embryos were subjected to Hes-1 antibody staining (no 
hematoxylin counterstaining was applied). The enterocytes are positive for Hes-1 (red 
nuclear staining, indicated by arrowheads) in wild type (A) and Mathl null (B) mice. The 
dark apical patches [arrow in (A)] represent nonspecific staining of the secretory products of 
the goblet cells. Original magnification 400x. 

DETAILED DESCRIPTION OF THE INVENTION 

[0066] It is readily apparent to one skilled in the art that various embodiments and 
modifications can be made to the invention disclosed m this Application without departing 
firom the scope and spirit of the invention. 
I. Definitions 
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[0067] The term "a" or "an" as used herein in the specification may mean one or 
more. As used herein in the claim(s), when used in conjunction with the word "comprising", 
the words "a" or "an" may mean one or more than one. As used herein "another" may mean 
at least a second or more. 

[0068] The term "abnormal proliferation" as used herein is defined as any 
proUferation of any type of cell, wherein said cell is not under the constraints of normal cell 
cycle progression and wherein said proUferation can result in a tumor or any cancerous 
development. 

[0069] The term "alteration" as used herein is defined as any type of change or 
modification to a nucleic acid or amino acid. Said change or modification includes any 
mutation, deletion, rearrangement, addition to a nucleic acid. This includes 
posttranscriptional processing such as addition of a 5 cap, intron processing and 
polyadenylation. Mutations can be nonsense, missense, fi-ameshift, or could lead to a 
truncated amino acid or could alter the conformation of the amino acid. The alteration to a 
nucleic acid can be present in regulatory sequences or can affect trans-acting factors. Also, 
multiple alterations can be present. Said change or modification also includes any change to 
an amino acid including methylation, myristilation, acetylation, glycosylation, or a change to 
signals associated with processing of said amino acid including intracellular or intercellular 
localization signals and cleaving of extraneous amino acids. Said alteration can also affect 

- - - degradation or folding of said protein . _ 

[0070] The term "flto«a/-associated" as used herein is defined as any nucleic acid 
sequence or amino acid sequence which is the Drosophila atonal nucleic acid sequence or 
amino acid sequence, or is any sequence which is homologous to or has significant sequence 
similarity to said nucleic acid or amino acid sequence, respectively. The sequence can be 
present in any animal including mammals and insects. As used herein, significant sequence 
similarity means similarity is greater than 25% and can occur in any region of another 
sequence. Examples of a/o/iaZ-associated include but are not limited to Mathl (mouse atonal 
homolog 1), CathJ (chicken atonal homolog 1), HathJ (human atonal homolog 1), and Xathl 
(Xenopus atonal homolog 1). Furthermore, multiple homologous or similar sequences can 
exist in an animal. 

[0071] The terai "defect" as used herein is defined as an alteration, mutation, flaw 
or loss of expression of an fl^o/iaZ-associated sequence. A skilled artisan is aware that loss of 
expression concerns expression levels of an a/o/ifl/-associated sequence which are not 
significant or detectable by standard means in the art. A skilled artisan is also aware that 
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loss, or absence, of expression levels in an adult organism, such as a human, occurs naturally 
and leads to impairment of hearing over time. Thus, "defect" as used herein includes the 
natural reduction or loss of expression of an atonaZ-associated sequence. 

[0072] The term "delivering" as used herein is defined as bringing to a destination 
and includes administering, as for a therapeutic purpose. 

[0073] The term "delivery vehicle" as used herein is defined as an entity which is 
associated with transfer of another entity. Said delivery vehicle is selected fi-om the group 
consisting of an adenoviral vector, a retroviral vector, an adeno-associated vector, a plasmid, 
a liposome, a nucleic acid, a peptide, a lipid, a carbohydrate and a combination thereof. 

[0074] The term "detectable condition" as used herein is defined as any state of 
health or status of an animal, organ or tissue characterized by specific developmental or 
pathological symptoms. Examples include but are not limited to loss of hair cells, cerebellar 
granule neuron deficiencies, hearing impairment, imbalance, joint disease, osteoarthritis, 
abnoraial proliferation of neoplastic neuroectodermal cells and formation of 
meduUoblastoma. 

[0075] The term "downregulated expression" as used herein is defined as the 
expression of an atowaZ-associated sequence in approximately less than wild type quantities 
in a cell, including substantially lacking any expression, in which the ato/ia/-associated 
sequence is not naturally found in the cell. 

[0076] The term "gastrointestinal condition" as used herein is defined as a 

condition or disease which affects at least one aspect of the gastrointestinal system of an 
individual, including the small intestine and large intestine (colon). The gastrointestinal 
condition may be the direct result of a disease, for example, although it may be an indirect 
resuh of a disease. Examples include damage to the intestine as a result of a disease or 
infection. Examples of gastrointestinal diseases include inflammatory bowel disease, irritable 
bowel syndrome, necrotizing entercolitis, cancer, and pathogenic infection. 

[0077] The term "heterologous" as used herein is defined as nucleic acid sequence 
which is of or relating to nucleic acid sequence not naturally occurring in a particular locus. 
In an alternative embodiment, the heterologous nucleic acid sequence naturally occurs in a 
particular locus, but contains a molecular alteration compared to the naturally occurring 
locus. For instance, a wild-type locus of an atonaZ-associated sequence can be used to 
replace a defective copy of the same sequence. 

[0078] The term "inactivated" as used herein is defined as a state in which 
expression of a nucleic acid sequence is reduced or completely eliminated. Said inactivation 
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can occur by transfer or insertion of another nucleic acid sequence or by any means standard 
in the art to affect expression levels of a nucleic acid sequence. 

[0079] The term "precursors" as used herein is defined as progenitor cells from 
which other cells derive their origin and/or properties. 

[0080] The term "regulatable reporter sequence" as used herein is defined as any 
sequence which directs transcription of another sequence and which itself is under regulatory 
control by an extrinsic factor or state. Examples of extrinsic factors or states include but are 
not Umited to exposure to chemicals, nucleic acids, proteins, peptides, lipids, carbohydrates, 
sugars, light, sound, hormones, touch, or tissue-specific milieu. Examples of regulatable 
reporter sequences include the GAL promoter sequence and the tetracycline 
promoter/transactivator sequence. 

[0081] The term "regulatory sequence" as used herein is defined as any sequence 
which controls either directly or indirectly the transcription of another sequence. Said control 
can be either regarding the initiation or cessation of transcription or regarding quantity or 
tissue distribution of transcription. 

[0082] The term "reporter sequence" as used herein is defined as any sequence 
which demonstrates expression by a regulatory sequence. Said reporter sequence can be used 
as a marker in the form of an RNA or in a protein. Examples of reporter sequences are b- 
galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), neomycin, 
kananiycin, luciferase, b-glucuronidase and chloramphenicol transferase (CAT). In a specific 
aspect of the present invention, the presence and quantity of the reporter sequence product, 
whether it be a nucleic acid or amino acid, reflects the level of transcription by the promoter 
sequence which regulates it. 

[0083] The term "therapeutically effective" as used herein is defined as the 
amount of a compound required to improve some symptom associated with a disease. For 
example, in the treatment of hearing impairment, a compound which improves hearing to any 
degree or arrests any symptom of hearing impairment would be therapeutically effective. In 
the treatment of a joint disease, a compound which improves the health or movement of a 
joint to any degree or arrests any symptom of a joint disease would be therapeutically 
effective. In the treatment of abnormal proliferation of cells, a compound which reduces the 
proliferation would be therapeutically effective. In the treatment of cancer, a compound 
which reduces proliferation of the cells, reduces tumor size, reduces metastases, reduces 
proliferation of blood vessels to said cancer, facilitates an immune response against the 
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cancer would be therapeutically effective, for example. A therapeutically effective amount of 
a compound is not required to cure a disease but will provide a treatment for a disease. 

[0084] The term "upregulated expression" as used herein is defined as the 
expression of an ato/«i/-associated sequence in approximately wild type or greater than wild 
type quantities in a cell in which the a/o/iaZ-associated sequence is naturally found. 

[00851 The term "vector" as used herein is defined as a biological vehicle for 
delivery of a specific entity. In a specific embodiment the entity is an a/ona/-associated 
nucleic acid. 

n. The Present Invention 

[00861 The mouse small intestinal epithelium consists of four principal cell types 
deriving fi-om one multipotent stem cell: enterocytes, goblet, enteroendocrine, and Paneth 
cells. Previous studies showed that Mathl, a basic helix-loop-helix (bHLH) transcription 
factor, is expressed in the gut. However, the present invention demonstrates that, although 
Mathl is involved in governing differentiation in neuronal cells, it was surprising that Mathl 
has no detectable levels in the gut nervous system. The present mvention demonstrates that 
loss of Mathl leads to depletion of goblet cells (which secrete mucous important for food 
movement), enteroendocrine cells (which secrete regulatory peptides), and Paneth cells 
(which secrete microbe-fighting peptides) without affecting enterocytes. Colocalization of 
Mathl with Ki-67 in some proliferating cells suggests that secretory cells (goblet, 

— ^enteroendocrinerand Paneth cells)-arise fi:om-a-common progenitor that expresses Mathl, 

whereas absorptive cells (enterocytes, which absorb nutrients) arise firom a progenitor that is 
A/a/A7-independent. The continuous, rapid renewal of these cells makes the intestinal 
epithelium a model system for the study of stem cell regeneration and lineage conunitment. 
Furthermore, the MATHl protein regulates the Delta-Notch signaling pathway that governs 
endocrine cell differentiation. Clearly, a basic understanding of the intestinal cell 
differentiation will provide new treatments for such diseases as irritable bowel syndrome and 
other abnormalities in motility of the gut. Also, since these intestinal cells depend on these 
regulatory pathways to signal them to stop proliferating, these pathways are useful for 
addressing the mechanisms of colon cancers. Furthermore, understanding the regulatory 
control of intestinal stem cells could lead to treatments to regenerate damaged intestinal 
tissue. In a specific embodiment, a donnant stem cell is provided to an individual with at 
least one regulatory factor to encourage differentiation and replace cells lost to injury. In a 
specific embodiment, the regulatory factor for enhancing differentiation of an intestinal stem 
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cell to a secretory intestinal cell is a bone morphogenetic protein (BMP), such as GDF7, a 
BMP known to induce Mathl in spinal cord. In another specific embodiment, the regulatory 
factor for enhancing differentiation of an intestinal stem cell to an absorptive intestinal cell is 
a member of the hairy/enhancer of split (HES) family. In a specific embodiment, homologs 
of Has family members downregulate a/o/iaZ-related sequences. Examples of Hes family 
members include Hesl, Hes2, Hes3, Hes4, Hes5, Hes6, Hes7, and Hes/ESpl related proteins 
(HERPl and HERP2) (which are upregulated by Notch and can heterodimerize with Hes 
members to repress bHLH gene expression). 

[0087] In a specific embodiment, similar methods and compositions are utilized 
for other types of stem cells. 

[0088] In one aspect of the present invention there are methods and reagents 
which include utilization of an atonaZ-associated nucleic acid or amino acid sequence for the 
therapeutic use of a gastrointestinal. Thus, any homolog or ortholog of atonal (from 
Drosophila) including but not limited to Cathl (from chicken), Hathl (from human), Mathl 
(fi-om mice) or Xathl (firom Xenopus) can be used in the present invention. In a preferred 
embodiment these sequences are directed to treatment of an animal, specifically a human, for 
a gastrointestinal condition. It is within the scope of the invention to encompass any 
sequence which is homologous to or has significant sequence similarity to said nucleic acid 
or amino acid sequence, respectively. The sequence can be present in any animal including 
mammals and insects. - As used herein, significant sequence, similarity- means similarity 
(identity of amino acid residues or nucleic acid bases) is greater than 25% and can occur in 
any region of the sequence. In another embodiment an a^o/iaZ-associated sequence as used 
herein has greater than about 50% sequence similarity, greater than about 70% similarity, or 
greater than about 80% similarity. 

[0089] It is within the scope of the present invention that an Ato/uiZ-associated 
nucleic acid sequence or amino acid sequence is utilized wherein domains important for 
activity, such as the basic HLH region, are included in a molecule but further comprise 
alterations, mutations, deletions or substitutions in regions of the nucleic acid or amino acid 
sequence which are not part of a domain important for an activity and do not affect its 
function. 

[0090] Examples of a/onaZ-associated sequences include but are not limited to 
Mathl (mouse atonal homolog 1), Cathl (chicken atonal homolog 1), Hathl (human atonal 
homolog I), and Xathl (Xenopus atonal homolog 1). Such examples are represented in SEQ 
ID N0:1 through SEQ ID NO:66, although others very likely exist in related organisms. A 
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skilled artisan is cognizant of means to identify such sequences which have significant 
similarity, such as searching database collections of nucleic and amino acid sequence located 
on the World Wide Web, including at the site for the National Center for Biotechnology 
Information's GenBank database. 

[0091] The sequences provided herein and the corresponding GenBank Accession 
numbers are listed parenthetically as follows: SEQ ID N0:1 (NM_005172); SEQ ID N0:2 
(NPJ05163.1 ); SEQ ID N0:3 (AW413228); SEQ ID NO: 4 (NM_009719); SEQ ID NO:5 
(NP_033849.1); SEQ ID N0:6 (NM_009718); SEQ ID NO: 7 (NP_033848.1) SEQ ID NO:8 
(NM_009717); SEQ ID NO: 9 (NP_033847.1); SEQ ID NO:10 (NM_007500); SEQ ID NO: 
11(NP_031526.1); SEQ ID N0:12 (NM_007501 ); SEQ ID N0:13 (AW280518); SEQ ID 
NO:14(AW236965 ); SEQ ID NO: 15(AW1 63683); SEQ ID N0:16 (AF134869); SEQ ID 
NO: 17(AAD31451.1); SEQ ID N0:18 (AJ012660); SEQ ID N0:19 (CAA10106.1); SEQ ID 
NO:20 (AJ012659); SEQ ID N0:21 (CAA10105.1); SEQ ID N0:22 (AF071223); SEQ ID 
NO:23 (AAC68868.1); SEQ ID N0:24 (U76208); SEQ ID NO:25 (AAC53029.1); SEQ ED 
NO:26 (U76210); SEQ ID NO:27 (AAC53033.1); SEQ ID NO:28 (U76209); SEQ ID NO:29 
(AAC53032.1); SEQ ID NO:30 (U76207); SEQ ID N0:31 (AAC53028.1); SEQ ID NO:32 
(AF036257); SEQ ID NO:33 (AAC15969.1); SEQ ID NO:34 (AF034778); SEQ ID N0:35 
(AJ001178); SEQ ID NO:36 (CAA04572.1); SEQ ID NO:37 (Y07621); SEQ ID NO:38 
(CAA68900.1); SEQ ID NO:39 (AF024536); SEQ ID NO:40 (AAB82272.1); SEQ ID 
N0:41 (D85188); SEQ ID NO:42 (BAA12738.1); SEQ ID NO:43 (D44480); SEQ ID 
NO:44(BAA07923.1); SEQ ID NO:45 (D43694); SEQ ID NO:46 (BAA07791.1); SEQ ID 
NO:47 P85845); SEQ ID NO:48 (BAA12880.1); SEQ ID NO:49 (U93171); SEQ ID NO:50 
(AAB58669.1); SEQ ID N0:51 (U93170); SEQ ID NO:52 (AAB58668.1); SEQ ID NO:53 
(U61152); SEQ ID NO:54 (AAB41307.1); SEQ ID NO:55 (U61151); SEQ ID NO:56 
(AAB41306.1); SEQ ID NO:57 (U61148); SEQ ID NO:58 (AAB41305.1); SEQ ID NO:59 
(U61149); SEQ ID NO:60 (AAB41304.1); SEQ ID N0:61 (U61150); SEQ ID NO:62 
(AAB41303.1); SEQ ID N0:63 (L36646); and SEQ ID NO:64 (AAA21 879.1). 

[0092] In an aspect of the invention there is an animal having a heterologous 
nucleic acid sequence replacing an allele of an a/o/ia/-associated nucleic acid sequence under 
conditions wherein said heterologous sequence inactivates said allele. In an alternative 
embodiment a heterologous sequence is delivered to a cell for extrachromosomal 
propagation. In another alternative embodiment a heterologous sequence is integrated into 
the chromosome of a cell in a locus other than the locus of an ato/ia/-associated nucleic acid 
sequence. In a preferred embodiment said heterologous sequence is expressed under control 
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of an atowaZ-associated regulatory sequence. In a specific embodiment both atonal- 
associated alleles are replaced. In an additional specific embodiment both atonaZ-associated 
alleles are replaced with nonidentical heterologous nucleic acid sequences. Methods to 
generate transgenic animals are well known in the art, and a skilled artisan would refer to 
such references as Transgenic Animals by Grosveld and Kollias (eds.) or Mouse Genetics and 
Transgenics : A Practical Approach by Jackson et al (eds.). 

[0093] In another embodiment of the present invention is a method for screening 
for a compound in an animal, wherein said compound affects expression of an atonal- 
associated nucleic acid sequence comprising delivering said compound to said animal 
wherem said animal has at least one allele of an atowaZ-associated nucleic acid sequence 
inactivated by insertion of a heterologous nucleic acid sequence wherein said heterologous 
nucleic acid sequence is under control of an aro/ia/-associated regulatory sequence, and 
monitoring for a change in said expression of said a/onaZ-associated nucleic acid sequence. 
Examples of regulatory sequences can include promoter sequences, enhancers or silencers. 

[0094] In specific embodiments there is a compound which upregulates or 
downregulates said expression of an atowaZ-associated nucleic acid sequence. The 
upregulation or downregulation can be by increasing the rate of transcription or decreasing 
the rate of mRNA decay. 

[00951 Another embodiment of the present invention is a compound which affects 
expression of an a/o/z^iZ-associated nucleic acid sequence. In specific embodiments said 
compound upregulates or downregulates expression of an atonal-^ssoci^XQd nucleic acid 
sequence. In a specific embodiment said compound affects a gastrointestinal condition. 

[0096] Another embodiment of the present invention is a method for screening for 
a compound in an animal, wherein the compound affects a detectable condition in the animal, 
comprising delivering the compound to the animal wherein at least one allele of an atonal- 
associated nucleic acid sequence in said animal is inactivated by insertion of a heterologous 
nucleic acid sequence, wherein said heterologous nucleic acid sequence is under the control 
of an ato/iaZ-associated regulatory sequence, and monitoring said animal for a change in the 
detectable condition. In another embodiment said delivery of said compound affects 
expression of said heterologous nucleic acid sequence. In specific embodiments said 
expression of said heterologous nucleic acid sequence is upregulated or downregulated. In 
additional specific embodiments the animal is a mouse, Drosophila, firog, zebrafish, rat, 
hamster and guinea pig. 
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[0097] Another embodiment of the present invention is a compound wherein said 
compound affects a gastrointestinal condition in a transgenic animal of the present invention. 
In specific embodiments said compound affects expression of a heterologous nucleic acid 
sequence. In additional specific embodiments said compound upregulates or downregulates 
expression of a heterologous nucleic acid sequence. 

[0098] In other embodiments of the present invention are methods of treating an 
animal, including a human, for a gastrointestinal condition. Said methods include 
administering a therapeutically effective amount of an afonoZ-associated nucleic acid or 
amino acid sequence. In specific embodiments said administration is by a vector selected 
from the group consisting of a viral vector (including bactieriophage, animal and plant 
viruses), a plasmid, cosmid or any other nucleic acid based vector, a liposome, a nucleic acid, 
a peptide, a lipid, a carbohydrate and a combination thereof of said vectors. In a specific 
embodiment said viral vector is an adenovirus vector, a retrovirus vector, or an adeno- 
associated vector, including a lentivirus vector, Herpes virus vector, alpha virus vector, etc. 
Thus, a vector can be viral or non-viral. In another specific embodiment said vector is a cell. 
In a preferred embodiment said vector is an adenovirus vector comprising a cytomegalovirus 
IE promoter sequence and a SV40 early polyadenylation signal sequence. In another specific 
embodiment said cell is a hiunan cell. 

[0099] In an embodiment of the present invention there is provided a method for 
treating an organism for a disease that is a result of loss of functional flto«a/-associated 
nucleic acid or amino acid sequence. A skilled artisan is aware that this loss can be due to 
natural reduction or absence of significant (or to detectable levels) expression which occurs 
in an adult human. 

[0100] In a specific embodiment, the present invention also provides a method of 
treating an animal in need of treatment for a deficiency in the intestine. This method 
comprises delivering a transcription factor having an amino acid with at least about 70% 
identity, preferably at least about 80% identity, and more preferably at least about 90% 
identity to the sequence AANARERRRMHGLNHAFDQLR to a cell in the animal. In some 
embodiments, the cell in the animal is located in the inner ear of the animal. Preferably, the 
transcription factor competes with atonal for binding to Daughterless protein (Jarman et al, 
1993) or competes for binding with Math-1 to E47 protein (Akazawa et al., 1995). 

[0101] In a preferred embodiment of the present invention there are compositions 
to treat an organism for various medical conditions, discussed herein, comprising an atonal- 
associated nucleic acid sequence or amino acid sequence in combination with a delivery 
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vehicle, wherein said organism comprises a defect in an a/ona/-associated nucleic acid 
sequence. A skilled artisan is aware that an adult organism, such as an adult human, naturally 
does not express atonal to significant or detectable levels, but instead expresses atonal in an 
embryonic stage of development (see the Examples). Thus, in a preferred embodiment, 
compositions to treat an organism as discussed herein, include compositions to treat 
organisms who do not contain a mutation in an atonal nucleic acid or amino acid sequence 
but who naturally have atonal no longer expressed to significant or detectable levels. 

[0102] In another embodiment of the present invention is a composition 
comprising an a/onaZ-associated amino acid sequence or nucleic acid sequence in 
combination with a delivery vehicle wherein said vehicle delivers a therapeutically effective 
amount of an atonaZ-associated nucleic acid sequence or amino acid sequence into a cell. In 
specific embodiments said vehicle is the receptor-binding domain of a bacterial toxin or any 
fiision molecule or is a protein transduction domain. In a specific embodiment said protein 
transduction domain is firom the HIV TAT peptide. 

[0103] In another embodiment of the present invention there is a composition to 
treat an organism for cancer, wherein said organism comprises a defect in an atonal- 
associated nucleic acid sequence. In a specific embodiment the defect is a mutation or 
alteration of said a/ona/-associated nucleic acid sequence. In another specific embodiment 
the defect affects a regulatory sequence of said afoncZ-associated nucleic acid sequence. In 
- an additional embodiment of the present invention there is a composition to treat an organism 
for cancer, wherein said organism comprises defect in a nucleic acid sequence which is 
associated with regulation of an ato/iaZ-associated nucleic acid sequence. In an additional 
embodiment of the present invention there is a composition to treat an organism for cancer, 
wherein said organism comprises a defect in an amino acid sequence which is associated with 
regulation of an a/onaZ-associated nucleic acid sequence. In a specific embodiment said 
cancer is medulloblastoma. 
IIL Gastrointestinal Conditions 

[0104] In specific embodiments, the methods of treatment for a gastrointestinal 
condition described herein are used alone or in conjunction with standard therapies for the 
gastrointestinal condition. 

A. Inflammatory Bowel Disease 

[0105] Inflammatory bowel disease is the name of a group of disorders that cause 
the intestines to become inflamed (red and swollen). The inflanmiation lasts a long time and 
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usually is recurring. Symptoms include abdominal cramps and pain, diarrhea, weight loss 
and/or bleeding from the intestines. Two kinds of inflammatory bowel disease include 
Crohn's disease and ulcerative colitis. Crohn's disease usually causes ulcers (open sores) 
along the length of the small and large intestines. Crohn's disease either does not affect the 
rectum or causes inflanunation or infection with drainage around the rectum. Ulcerative 
colitis usually causes ulcers in the lower part of the large intestine, often starting at the 
rectum. 

10106] Treatment for inflanunatory bowel disease includes removing the 
inflammation by taking anti-inflammatory medicines including sulfasalazine (brand name: 
Azulfidine), olsalazine (brand name: Dipentum) and mesalamine (brand names: Asacol, 
Pentasa, Rowasa). An antibiotic, such as metronidazole (brand name: Flagyl), may be helpful 
for killing germs in the intestines, especially for Crohn's disease. Corticosteroids, such as 
prednisone, often are also prescribed. 

B. Irritable Bowel Syndrome 

[0107] Irritable bowel syndrome (IBS), also referred to as fiinctional bowel 
syndrome, irritable colon, spastic bowel or spastic colon, is a problem with the intestines, 
wherein they squeeze too hard or not hard enough, causing food to move too quickly or too 
slowly through the intestines. Symptoms include bloating and gas, constipation, diarrhea, 
especially after eating or first thing in the morning, feeling an urge to have a bowel 
movement after ahready having had one, feeling a strong urge to have a bowel movement, 
and/or abdominal pain and cramping that may go away after having a bowel movement. 

C. Pathogenic Organisms 

1. Helicobacteriosis 

[0108] Helicobacteriosis refers to infection of the gastrointestinal tract with the 
bacteria, Helicobacter pylori (H. pylori). It is a primary cause of ulcer disease and has 
revolutionized the treatment of peptic ulcer disease. It is also believed to be a cause of 
various cancers of the stomach. 

[0109] H. pylori is a gram-negative spiral shaped organism that contains flagella 
(tail-like structure) and other properties that allow it to survive in the acidic environment of 
the stomach. In addition to flagella, which allow the organism to move around in the liquid 
mucous layer of the stomach, i/. pylori also produces the enzyme "urease" that protects it 
from harm by gastric acid. As the production of this enzyme is relatively unusual, new 
diagnostic tests have enabled rapid identification of the bacteria. K pylori also produces two 
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Other chemicals: the cytotoxin VacA and the protein CagA. Patients with ulcer disease are 
more likely to produce the cytotoxin (VacA). The CagA protein not only occurs frequently in 
ulcer disease but also in cancer. The bacteria is well adapted to survival within the stomach, 
surviving there for years. However, once infection begins, a form of chronic inflammation 
(chronic gastritis) always develops. In most individuals, initial infection causes little or no 
synq)toms, although some individuals experience abdominal pain and nausea. 

(01101 In about 15% of infected persons, ulcer disease develops either in the 
stomach or duodenum. Acid secretion increases in most patients with duodenal ulcers. This 
increase returns to normal once H. pylori is eliminated. It is now known that elimination of 
the bacteria will decrease substantially the risk of recurrent bouts of ulcer disease in the far 
majority (85% or so) of patients. 

[01111 In the last decade it has been shown that H. pylori is not only the prime 
cause of ulcer disease of the stomach and duodenum, but is also strongly associated with 
various tumors of the stomach. Bacterial infection is 9 times more common in patients with 
cancer of the stomach, and 7 times more common in those with lymphoma of the stomach 
(tumor of the lymphatic tissue), called a MALT tumor. It is believed that the prolonged 
inflammation leads to changes in cell growth and tumors. Eliminating H. pylori can lead to 
regression of some tumors. 

[01121 In addition to the above-described damage caused by H. pylori^ some 
individuals lose normal gastric function, such as.the ability to absorb, vitamin B-1 2. . 
2. Giardiasis 

[01131 Giardia is a microscopic parasite that can live in the human bowel, and 
infection by this parasite is called giardiasis. Some symptoms of giardiasis are diarrhea, 
belching, gas and cramps. Giardiasis is easy to contract upon drinking untreated spring water 
or stream water. Many animals carry Giardia in their feces and may introduce this parasite 
into rivers, streams and springs in rural areas. 
IV. Nucleic Acid-Based Expression Systems 

A. Vectors 

[01 14] One of skill in the art would be well equipped to construct a vector through 
standard recombinant techniques, which are described in Sambrook et al., 1989 and Ausubel 
et al., 1994, both incorporated herein by reference. 

[01151 The term "expression vector" refers to a vector containing a nucleic acid 
sequence coding for at least part of a gene product capable of being transcribed. In some 
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cases, RNA molecules are then translated into a protein, polypeptide, or peptide. In other 
cases, these sequences are not translated, for example, in the production of antisense 
molecules or ribozymes. Expression vectors can contain a variety of "control sequences," 
which refer to nucleic acid sequences necessary for the transcription and possibly translation 
of an operably linked coding sequence in a particular host organism. In addition to control 
sequences that govern transcription and translation, vectors and expression vectors can 
contain nucleic acid sequences that serve other functions as well and are described infra. 
1. Promoters and Enhancers 
[0116] A "promoter" is a control sequence that is a region of a nucleic acid 
sequence at which initiation and rate of transcription are controlled. It can contain genetic 
elements at which regulatory proteins and molecules can bind such as RNA polymerase and 
other transcription factors. A promoter may or may not be used in conjunction with an 
"enhancer," which refers to a cis-acting regulatory sequence involved in the transcriptional 
activation of a nucleic acid sequence. 

10117] Naturally, it will be important to employ a promoter and/or enhancer that 
effectively directs the expression of the DNA segment in the cell type, organelle, and 
organism chosen for expression. Those of skill in the art of molecular biology generally 
know the use of promoters, enhancers, and cell type combinations for protein expression, for 
example, see Sambrook et al. (1989), incorporated herein by reference. The promoters 
employed can be constitutive, tissue-specific, inducible, and/or useful under the appropriate 
conditions to direct high level expression of the introduced DNA segment, such as is 
advantageous in the large-scale production of recombinant proteins and/or peptides. The 
promoter can be heterologous or endogenous. 

[0118] The identity of tissue-specific promoters or elements, as well as assays to 
characterize their activity, is well known to those of skill in the art. Examples of such regions 
include the human LIMK2 gene (Nomoto et al. 1999), the somatostatin receptor 2 gene 
(Kraus et al., 1998), murine epididymal retinoic acid-binding gene (Lareyre et al., 1999), 
human CD4 (Zhao-Emonet et al., 1998), mouse alpha2 (XI) collagen (Tsumaki, et al., 1998), 
DIA dopamine receptor gene (Lee, et al, 1997), insulin-like growth factor H (Wu et al, 
1997), human platelet endothelial cell adhesion molecule-1 (Almendro et al., 1996). 

2. Initiation Signals and Internal Ribosome Binding Sites 

10119] A specific initiation signal also can be required for efficient translation of 
coding sequences. These signals include the ATG initiation codon or adjacent sequences. 
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Exogenous translational control signals, including the ATG initiation codon, can need to be 
provided. One of ordinary skill in the art would readily be capable of determining this and 
providing the necessary signals. It is well known that the initiation codon must be "in-frame" 
with the reading frame of the desired coding sequence to ensure translation of the entire 
insert. The exogenous translational control signals and initiation codons can be either natural 
or synthetic. The efficiency of expression can be enhanced by the inclusion of appropriate 
transcription enhancer elements. 

[0120] In certain embodiments of the invention, the use of internal ribosome entry 
sites (IRES) elements are used to create multigene, or polycistronic, messages. IRES 
elements can be linked to heterologous open reading frames. Multiple open reading frames 
can be transcribed together, each separated by an IRES, creating polycistronic messages. By 
virtue of the IRES element, each open reading frame is accessible to ribosomes for efficient 
translation. Multiple genes can be efficiently expressed using a single promoter/enhancer to 
transcribe a single message (see U.S. Patent 5,925,565 and 5,935,819, herein incorporated by 
reference). 

3. Multiple Cloning Sites 

[01211 Vectors can include a muhiple cloning site (MCS), which is a nucleic acid 
region that contains multiple restriction enzyme sites, any of which can be used in 
conjunction with standard recombinant technology to digest the vector. (See Carbonelli et 
dl, 1999, Levensori era/;, 1998, and Cocea, 1997, incorporated herein by reference.) 

4. Splicing Sites 

[01221 Most transcribed eukaryotic RNA molecules will undergo RNA splicing to 
remove introns from the primary transcripts. Vectors containing genomic eukaryotic 
sequences can require donor and/or acceptor splicing sites to ensure proper processing of the 
transcript for protein expression. (See Chandler et al, 1997, herein incorporated by 
reference.) 

5. Polyadeaylation Signals 

[0123] In expression, one will typically include a polyadenylation signal to effect 
proper polyadenylation of the transcript. Specific embodiments include the SV40 
polyadenylation signal and/or the bovine growth hormone polyadenylation signal, convenient 
and/or known to ftmction well in various target cells. 

6. Origins of Replication 



wo 03/047532 



PCT/US02/41458 



[0124] In order to propagate a vector in a host cell, it can contain one or more 
origins of replication sites (often termed "ori"), which is a specific nucleic acid sequence at 
which replication is initiated. 

7. Selectable and Screenable Markers 

[01251 In certain embodiments of the invention, the cells contain nucleic acid 
construct of the present invention, a cell can be identified in vitro or in vivo by including a 
marker in the expression vector. Such markers would confer an identifiable change to the 
cell permitting easy identification of cells containing the expression vector. Genendly, a 
selectable marker is one that confers a property that allows for selection. A positive 
selectable marker is one in which the presence of the marker allows for its selection, while a 
negative selectable marker is one in which its presence prevents its selection. An example of 
a positive selectable marker is a drug resistance marker. 

[01261 Usually the inclusion of a drug selection marker aids in the cloning and 
identification of transformants, for example, genes that confer resistance to neomycin, 
puromycin, hygromycin, DHFR, GPT, zeocin and histidinol are usefiil selectable markers. In 
addition to markers conferring a phenotype that allows for the discrimination of 
transformants based on the implementation of conditions, other types of markers including 
screenable markers such as GFP or enhanced GFP, whose basis is colorimetric analysis, are 
also contemplated. Altematively, screenable enzymes such as herpes simplex virus 
thymidine kinase (tk) or chloramphenicol acetyltransferase (CAT) can be utilized. One of 
skill in the art would also know how to employ immunologic markers, possibly in 
conjunction with FACS analysis. Further examples of selectable and screenable markers are 
well known to one of skill in the art. 

B. Expression Systems 

[0127] Numerous expression systems exist that comprise at least a part or all of 
the compositions discussed above. Prokaryote- and/or eukaryote-based systems can be 
employed for use with the present invention to produce nucleic acid sequences, or their 
cognate polypeptides, proteins and peptides. Many such systems are commercially and 
widely available. 

[0128] The insect cell/baculovirus system can produce a high level of protein 
expression of a heterologous nucleic acid segment, such as described in U.S. Patent No. 
5,871,986, 4,879,236, both herein incorporated by reference, and which can be bought, for 
example, under the name MAXBAC® 2.0 fi-om INVITROGEN® and BACPACK™ 
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BACULOVIRUS EXPRESSION SYSTEM FROM CLONTECH®. Other examples of 
expression systems are well known in the art. 
V. Nucleic Acid Detection 

[0129] In addition to their use in directing the expression of a/o/ia/-associated 
proteins, polypeptides and/or peptides, the nucleic acid sequences disclosed herein have a 
variety of other uses. For example, they have utility as probes or primers or in any of the 
methods for embodiments involving nucleic acid hybridization, amplification of nucleic acid 
sequences, detection of nucleic acids, and other assays. A skilled artisan is aware of the 
following patents regarding details of these methods: U.S. Patent No. 5,840,873; U.S. Patent 
No. 5,843, 640; U.S. Patent No. 5,843,650; U.S. Patent No. 5,843,651; U.S. Patent No. 
5,843,663; U.S. Patent No. 5,846,708; U.S. Patent No. 5,846,709; U.S. Patent No. 5,846,717; 
U.S. Patent No. 5,846,726; U.S. Patent No. 5,846.729; U.S. Patent No. 5,846,783; U.S. Patent 
No. 5,849,481; U.S. Patent No. 5,849,483; U.S. Patent No. 5,849,486; U.S. Patent No. 
5,849,487; U.S. Patent No. 5,849,497; U.S. Patent No. 5,849,546; U.S. Patent No. 5,849,547; 
U.S. Patent No. 5,851,770; U.S. Patent No. 5,851,772; U.S. Patent No.5,853,990; U.S. Patent 
No. 5,853, 993; U.S. Patent No. 5,853,992; U.S. Patent No. 5.856,092; U.S. Patent No. 
5,858,652; U.S. Patent No. 5,861,244; U.S. Patent No.5,863,732; U.S. Patent No. 5,863,753; 
U.S. Patent No. 5,866,331; U.S. Patent No. 5,866,336; U.S. Patent No. 5,866,337; U.S. Patent 
No. 5,900,481; U.S. Patent No. 5,905,024; U.S. Patent No. 5,910,407; U.S. Patent No. 
5,912,124; U.S. Patent No. 5,912,145; U.S. Patent No. 5,912,148; U.S. Patent No. 5,916,776; 
U.S. Patent No. 5,916,779; U.S. Patent No. 5,919,626; U.S. Patent No. 5.919,630; U.S. Patent 
No. 5,922, 574; U.S. Patent No. 5,925,517; U.S. Patent No. 5,925,525; U.S. Patent No. 
5,928.862; U.S. Patent No. 5,928,869; U.S. Patent No. 5,928,870; U.S. Patent No. 5,928,905; 
U.S. Patent No. 5,928.906; U.S. Patent No. 5,929,227; U.S. Patent No. 5,932,413; U.S. Patent 
No. 5,932,451; U.S. Patent No. 5,935.791; U.S. Patent No. 5,935,825; U.S. Patent No. 
5,939,291; U.S. Patent No. 5,942,391; European Application No. 320 308; European 
Application No. 329 822; GB Application No. 2 202 328; PCT Application No. 
PCT/US87/00880; PCT Application No. PCT/US89/01025; PCT Application WO 88/10315; 
PCT Application WO 89/06700; and PCT Application WO 90/07641 . 
VL Kits 

[0130] All of the essential materials and/or reagents required for detecting a 
sequence selected from SEQ ID N0:1 through SEQ ID NO:66 in a sample can be assembled 
together in a kit This generally will comprise a probe or primers designed to hybridize 
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Specifically to individual nucleic acids of interest in the practice of the present invention, 
such as the nucleic acid sequences in SEQ ID N0:1 through SEQ ID NO:66. Also included 
can be enzymes suitable for amplifying nucleic acids, including various polymerases (reverse 
transcriptase. Tag, etc.), deoxynucleotides and buffers to provide the necessary reaction 
mixture for amplification. Such kits can also include enzymes and other reagents suitable for 
detection of specific nucleic acids or amplification products. Such kits generally will 
comprise, in suitable means, distinct containers for each individual reagent or enzyme as well 
as for each probe or primer pair. 

VIL i4to/ia/-Associated Nucleic Acids and Uses Thereof 

10131] A nucleic acid can be purified on polyacrylamide gels, cesium chloride 
centrifiigation gradients, or by any other means known to one of ordinary skill in the art (see 
for example, Sambrook et al. 1989, incorporated herein by reference). 

[01321 The temi "nucleic acid" will generally refer to at least one molecule or 
strand of DNA, RNA or a derivative or mimic thereof, comprising at least one nucleobase, 
such as, for example, a naturally occurring purine or pyrimidine base found in DNA (e.g. 
adenine "A," guanine "G," thymine "T" and cytosine "C") or RNA (e.g. A, G, uracil "U" and 
C). The term "nucleic acid" encompass the terms "oligonucleotide" and "polynucleotide." 
The term "oligonucleotide" refers to at least one molecule of between about 3 and about 100 
nucleobases in length. The term "polynucleotide" refers to at least one molecule of greater 
than about 100 nucleobases in length. These definitions generally refer to at least one single- 
stranded molecule, but in specific embodiments will also encompass at least one additional 
strand that is partially, substantially or fiiUy complementary to the at least one single-stranded 
molecule. Thus, a nucleic acid can encompass at least one double-stranded molecule or at 
least one triple-stranded molecule that comprises one or more complementary strand(s) or 
"complement(s)" of a particular sequence comprising a strand of the molecule. As used 
herein, a single stranded nucleic acid can be denoted by the prefix "ss", a double stranded 
nucleic acid by the prefix "ds", and a triple stranded nucleic acid by the prefix "ts." 

[0133] Thus, the present invention also encompasses at least one nucleic acid that 
is complementary to a atonaZ-associated nucleic acid. In particular embodiments the 
invention encompasses at least one nucleic acid or nucleic acid segment complementary to 
the nucleic acid sequences set forth in SEQ ID N0:1 through SEQ ID NO:66, of those which 
are nucleic acid sequences. Nucleic acid(s) that are "complementary" or "complement(s)" are 
those that are capable of base-pairing according to the standard Watson-Crick, Hoogsteen or 



wo 03/047532 



PCT/US02/41458 



reverse Hoogsteen binding complementarity rules. As used herein, the term 
"complementary" or "complement(s)" also refers to nucleic acid(s) that are substantially 
complementary, as can be assessed by the same nucleotide comparison set forth above. The 
temi "substantially complementary" refers to a nucleic acid comprising at least one sequence 
of consecutive nucleobases, or semiconsecutive nucleobases if one or more nucleobase 
moieties are not present in the molecule, are capable of hybridizing to at least one nucleic 
acid strand or duplex even if less than all nucleobases do not base pair with a counterpart 
nucleobase. 

[0134] Herein certain embodiments, a "gene" refers to a nucleic acid that is 
transcribed. As used herein, a "gene segment" is a nucleic acid segment of a gene. In certain 
aspects, the gene includes regulatory sequences involved in transcription, or message 
production or composition. In particular embodiments, the gene comprises transcribed 
sequences that encode for a protein, polypeptide or peptide. In other particular aspects, the 
gene comprises an atonal-associeXed nucleic acid, and/or encodes an a/owa/-associated 
polypeptide or peptide coding sequences. In keeping with the terminology described herein, 
an "isolated gene" can comprise transcribed nucleic acid(s), regulatory sequences, coding 
sequences, or the like, isolated substantially away from other such sequences, such as other 
naturally occurring genes, regulatory sequences, polypeptide or peptide encoding sequences, 
etc. In this respect, the term "gene" is used for simplicity to refer to a nucleic acid 
comprising a nucleotide sequence that is transcribed, and the complement thereof. In 
particular aspects, the transcribed nucleotide sequence comprises at least one functional 
protein, polypeptide and/or peptide encoding unit. As will be understood by those in the art, 
this function term "gene" includes both genomic sequences, RNA or cDNA sequences or 
smaller engineered nucleic acid segments, including nucleic acid segments of a non- 
transcribed part of a gene, including but not limited to the non-transcribed promoter or 
enhancer regions of a gene. Smaller engineered gene nucleic acid segments can express, or 
can be adapted to express using nucleic acid manipulation technology, proteins, polypeptides, 
domains, peptides, fusion proteins, mutants and/or such like. 

[01351 In certain embodiments, the nucleic acid sequence is a nucleic acid or 
nucleic acid segment. As used herein, the term "nucleic acid segment", are smaller fragments 
of a nucleic acid, such as for non-limiting example, those that encode only part of the atonal- 
associated peptide or polypeptide sequence. Thus, a "nucleic acid segment" can comprise 
any part of the a/o/ia/-associated gene sequence(s), of from about 2 nucleotides to the full 
length of the a/onaZ-associated peptide or polypeptide encoding region. In certain 
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embodiments, the "nucleic acid segment" encompasses the full length otonaZ-associated 
gene(s) sequence. In particular embodiments, the nucleic acid comprises any part of the SEQ 
ID N0:1 through SEQ ID NO:66, of from about 2 nucleotides to the foil length of the 
sequence disclosed in SEQ ID NO:l through SEQ ID NO:66. 

[0136] In certain embodiments, the nucleic acid segment can be a probe or primer. 
As used herein, a "probe" is a nucleic acid utilized for detection of another nucleic acid and is 
generally at least about 10 nucleotides in length. As used herein, a "primer" is a nucleic acid 
utilized for polymerization of another nucleic acid is generally at least about 10 nucleotides in 
length, A non-limiting example of this would be the creation of nucleic acid segments of 
various lengths and sequence composition for probes and primers based on the sequences 
disclosed in SEQ ID N0:1 through SEQ ID NO:66, of those which are nucleic acid 
sequences. 

[0137] The nucleic acid(s) of the present invention, regardless of the length of the 
sequence itself, can be combined with other nucleic acid sequences, including but not limited 
to, promoters, enhancers, polyadenylation signals, restriction enzyme sites, multiple cloning 
sites, coding segments, and the like, to create one or more nucleic acid construct(s). As used 
herein, a "nucleic acid construct" is a recombinant molecule comprising at least two segments 
of different nucleic acid sequence. The overall length can vary considerably between nucleic 
acid constructs. Thus, a nucleic acid segment of almost any length can be employed, with the 
total length preferably being limited by the ease of preparation or . use in the intended 
recombinant nucleic acid protocol. 

[01381 In certain embodiments, the nucleic acid construct is a recombinant vector. 
As used herein, a "recombinant vector" is a nucleic acid comprising multiple segments of 
nucleic acids utihzed as a vehicle for a nucleic acid sequence of interest. In certain aspects, 
the recombinant vector is an expression cassette. As used herein, an expression cassette is a 
segment of nucleic acid which comprises a gene of interest which can be transferred between 
different recombinant vectors by means well known in the art. 

[0139] In particular embodiments, the invention concerns one or more 
recombinant vector(s) comprising nucleic acid sequences that encode an ato/iaZ-associated 
protein, polypeptide or peptide that includes within its amino acid sequence a contiguous 
amino acid sequence in accordance with, or essentially as set forth in, SEQ ID NO:2 through 
SEQ ID NO:66, of which sequences are amino acid sequences, corresponding to Homo 
sapiens or Mus musculus a/o/ia/-associated sequence.. In other embodiments, the invention 
concerns recombinant vector(s) comprising nucleic acid sequences from other species that 
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encode an atowaZ-associated protein, polypeptide or peptide that includes within its amino 
acid sequence a contiguous amino acid sequence in accordance with, or essentially as set 
forth in SEQ ID N0:2 through SEQ ID NO:66, of which sequences are amino acid 
sequences. In particular aspects, the recombinant vectors are DNA vectors. 

[0140] It will also be understood that amino acid sequences or nucleic acid 
sequences can include additional residues, such as additional N- or C-terminal amino acids or 
5' or 3* sequences, or various combinations thereof, and yet still be essentially as set forth in 
one of the sequences disclosed herein, so long as the sequence meets the criteria set forth 
above, including the maintenance of biological protein, polypeptide or peptide activity where 
expression of a proteinaceous composition is concerned. The addition of terminal sequences 
particularly applies to nucleic acid sequences that can, for example, include various non- 
coding sequences flanking either of the 5' and/or 3* portions of the coding region or can 
include various internal sequences, z.e., introns, which are known to occur within genes. 

[0141] It will also be understood that this invention is not limited to the particular 
nucleic acid or ammo acid sequences of SEQ ID NO: through SEQ ID NO:66, of which 
sequences are amino acids. Recombinant vectors and isolated nucleic acid segments can 
therefore variously include these coding regions themselves, coding regions bearing selected 
alterations or modifications in the basic coding region, and they can encode larger 
polypeptides or peptides that nevertheless include such coding regions or can encode 
biologically functional equivalent proteins, polypeptide or peptides that have variant amino 
acids sequences. 

[0142] The nucleic acids of the present invention encompass biologically 
functional equivalent a/o/iaZ-associated proteins, polypeptides, or peptides or atonal- 
associated proteins, polypeptides or polypeptides. Such sequences can arise as a consequence 
of codon redundancy or functional equivalency that are known to occur naturally within 
nucleic acid sequences or the proteins, polypeptides or peptides thus encoded. Alternatively, 
functionally equivalent proteins, polypeptides or peptides can be created via the application 
of recombinant DNA technology, in which changes in the protein, polypeptide or peptide 
structure can be engineered, based on considerations of the properties of the amino acids 
being exchanged. Changes designed by man can be introduced, for example, through the 
application of site-directed mutagenesis techniques as discussed herein below, e.g., to 
mtroduce improvements or alterations to the antigenicity of the protein, polypeptide or 
peptide, or to test mutants in order to examine a/o/taZ-associated protein, polypeptide or 
peptide activity at the molecular level. 
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[0143] Fusion proteins, polypeptides or peptides can be prepared, e.g., where the 
atonal associated coding regions are aligned within the same expression unit with other 
proteins, polypeptides or peptides having desired functions. Non-limiting examples of such 
desired functions of expression sequences include purification or immunodetection purposes 
for the added expression sequences, e,g., proteinaceous compositions that can be purified by 
affinity chromatography or the enzyme labeling of coding regions, respectively EP 266,032, 
or via deoxynucleotide H-phosphonate intermediates as described by Froehler et al., Nucl. 
Acids Res., 14:5399-5407, 1986, 

[0144] As used herein an "organism" can be a prokaryote, eukaryote, virus and 
the like. As used herein the term "sequence" encompasses both the terms "nucleic acid" and 
"proteancecous" or "proteanaceous composition." As used herein, the term "proteinaceous 
composition" encompasses the terms "protein", "polypeptide" and "peptide." As used herem 
"artificial sequence" refers to a sequence of a nucleic acid not derived from sequence 
naturally occurring at a genetic locus, as well as the sequence of any proteins, polypeptides or 
peptides encoded by such a nucleic acid. A "synthetic sequence", refers to a nucleic acid or 
proteinaceous composition produced by chemical synthesis in vitro, rather than enzymatic 
production in vitro (i.e. an "enzymatically produced" sequence) or biological production in 
vivo (i.e. a "biologically produced" sequence). 
VIIL Cancer Therapies 

[0145] " ■ Given that the present invention is directed to methods and compositions 
for the treatment of abnoraial cell proliferation, a discussion of therapies of cancer, which is 
the state of abnormal cell proliferation, is warranted. In a specific embodiment, a 
gastrointestinal condition is a cancer, and the cancer is treated with standard therapies in 
addition to treatments described herein. 

[0146] A wide variety of cancer therapies, such as radiotherapy, surgery, 
chemotherapy and gene therapy, are known to one of skill in the art, can be used regarding 
the methods and compositions of the present invention. 

A. Radiotherapeutic agents 

[0147] Radiotherapeutic agents and factors include radiation and waves that 
induce DNA damage for example, g-irradiation. X-rays, UV-irradiation, microwaves, 
electronic emissions, radioisotopes, and the like. Therapy can be achieved by irradiating the 
localized tumor site with the above described forms of radiations. 
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[0148] Dosage ranges for X-rays range from daily doses of 50 to 200 roentgens 
for prolonged periods of time (3 to 4 weeks), to single doses of 2000 to 6000 roentgens. 
Dosage ranges for radioisotopes vary widely, and depend on the half-life of the isotope, the 
strength and type of radiation emitted, and the uptake by the neoplastic cells. 

B. Surgery 

101491 Surgical treatment for removal of the cancerous growth is generally a 
standard procedure for the treatment of tumors and cancers. This attempts to remove the 
entire cancerous growth. However, surgery is generally combined with chemotherapy and/or 
radiotherapy to ensure the destruction of any remaining neoplastic or malignant cells. Thus, 
surgery can be used in the context of the present invention. 

C. Chemotherapeutic Agents 

[0150] These can be, for example, agents that directly cross-link DNA, agents that 
intercalate into DNA, or agents that lead to chromosomal and mitotic aberrations by affecting 
nucleic acid synthesis. 

[0151] Agents that directly cross-link nucleic acids, specifically DNA, are 
envisaged and are shown herein, to eventuate DNA damage leading to a synergistic 
antineoplastic combination. Agents such as cisplatin, and other DNA alkylating agents can 
be used. 

[0152] Agents that damage DNA also include compounds that interfere with DNA 
replication, mitosis, and chromosomal segregation. Examples of these compounds include 
adriamycin (also known as doxorubicin), VP- 16 (also known as etoposide), verapamil, 
podophyllotoxin, and the like. Widely used in clinical setting for the treatment of neoplasms 
these compounds are administered through bolus injections intravenously at doses ranging 
from 25-75 mg/m2 at 21 day intervals for adriamycin, to 35-100 mg/m2 for etoposide 
intravenously or orally. 

[0153] Cancer therapies also include a variety of combination therapies with both 
chemical and other types of treatments. Chemotherapeutics include, for example, cisplatin 
(CDDP), carboplatin, procarbazine, mechlorethamine, cyclophosphamide, camptothecin, 
ifosfamide, melphalan, chlorambucil, busulfan, nitrosurea, dactinomycin, daunorubicin, 
doxorubicin, bleomycin, plicomycin, mitomycin, etoposide (VP 1 6), tamoxifen, raloxifene, 
estrogen receptor binding agents, taxol, gemcitabien, navelbine, famesyl-protein tansferase 
inhibitors, transplatinum, 5-fluorouracil, vincristin, vinblastin and methotrexate, or any 
analog or derivative variant of the foregoing. 
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D. Gene Therapy Administration 

[01541 For gene therapy, a skilled artisan would be cognizant that the vector to be 
utilized must contain the gene of interest operatively linked to a promoter. For antisense gene 
therapy, the antisense sequence of the gene of interest would be operatively linked to a 
promoter. One skilled in the art recognizes that in certain mstances other sequences such as a 
3' UTR regulatory sequences are useful in expressing the gene of interest. Where 
appropriate, the gene therapy vectors can be formulated mto preparations in solid, semisolid, 
liquid or gaseous forms in the ways known in the art for their respective route of 
administration. Means known in the art can be utilized to prevent release and absorption of 
the composition until it reaches the target organ or to ensure timed-release of the 
composition. A pharmaceutically acceptable form should be employed which does not 
mefifectuate the compositions of the present invention. In pharmaceutical dosage forms, the 
compositions can be used alone or in appropriate association, as well as in combination, with 
other pharmaceutically active compounds. A sufficient amount of vector containing the 
therapeutic nucleic acid sequence must be administered to provide a pharmacologically 
effective dose of the gene product. 

[0155] One skilled in the art recognizes that different methods of delivery can be 
utilized to administer a vector into a cell. Examples include: (1) methods utilizing physical 
means, such as electroporation (electricity), a gene gun (physical force) or applying large 
volumes of a liquid (pressure); and (2) methods wherein said vector is complexed to another 
entity, such as a liposome, viral vector or transporter molecule. 

[0156] Accordingly, the present invention provides a method of transferring a 
therapeutic gene to a host, which comprises administering the vector of the present invention, 
preferably as part of a composition, using any of the aforementioned routes of administration 
or alternative routes known to those skilled in the art and appropriate for a particular 
application. Effective gene transfer of a vector to a host cell in accordance with the present 
invention to a host cell can be monitored in ternis of a therapeutic effect (e.g. alleviation of 
some symptom associated with the particular disease being treated) or, further, by evidence 
of the transferred gene or expression of the gene within the host (e.g., using the polymerase 
chain reaction in conjunction with sequencing, Northem or Southern hybridizations, or 
transcription assays to detect the nucleic acid in host cells, or using immunoblot analysis, 
antibody-mediated detection, mRNA or protein half-life studies, or particularized assays to 
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detect protein or polypeptide encoded by the transferred nucleic acid, or impacted in level or 
function due to such transfer). 

[01571 These methods described herein are by no means all-inclusive, and further 
methods to suit the specific application will be apparent to the ordinary skilled artisan. 
Moreover, the effective amount of the compositions can be fiirther approximated through 
analogy to compoxmds known to exert the desired effect. 

[01581 Furthermore, the actual dose and schedule can vary depending on whether 
the compositions are administered in combination with other pharmaceutical compositions, or 
depending on interindividual differences in pharmacokinetics, drug disposition, and 
metabolism. Similarly, amounts can vary in in vitro applications depending on the particular 
cell line utilized (e.g., based on the number of vector receptors present on the cell surface, or 
the ability of the particular vector employed for gene transfer to replicate in that cell line). 
Furthermore, the amount of vector to be added per cell will likely vary with the length and 
stability of the therapeutic gene inserted in the vector, as well as also the nature of the 
sequence, and is particularly a parameter which needs to be determined empirically, and can 
be altered due to factors not inherent to the methods of the present invention (for instance, the 
cost associated with synthesis). One skilled in the art can easily make any necessary 
adjustments m accordance with the exigencies of the particular situation. 

[0159] It is possible that cells containing the therapeutic gene can also contam a 
suicide gene {i.e., a gene which encodes a product that can be used to destroy the cell, such as 
heipes simplex virus thymidine kinase). In many gene therapy situations, it is desirable to be 
able to express a gene for therapeutic purposes in a host cell but also to have the capacity to 
destroy the host cell once the therapy is completed, becomes uncontrollable, or does not lead 
to a predictable or desirable result. Thus, expression of the therapeutic gene in a host cell can 
be driven by a promoter although the product of said suicide gene remains harmless in the 
absence of a prodrug. Once the therapy is complete or no longer desired or needed, 
administration of a prodrug causes the suicide gene product to become lethal to the cell. 
Examples of suicide gene/prodrug combinations which can be used are Herpes Simplex 
Virus-thymidine kinase (HSV-tk) and ganciclovir, acyclovir or FIAU; oxidoreductase and 
cycloheximide; cytosine deaminase and 5-fluorocytosine; thymidine kinase thymidilate 
kinase (Tdk::Tmk) and AZT; and deoxycytidine kinase and cytosine arabinoside. 

[01601 The method of cell therapy can be employed by methods known in the art 
wherein a cultured cell containing a copy of a nucleic acid sequence or amino acid sequence 
of Mathl is introduced. 
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[01611 In yet another embodiment, the secondary treatment is a secondary gene 
therapy in which a second therapeutic polynucleotide is administered before, after, or at the 
same time a first therapeutic polynucleotide encoding all of part of an atonaZ-associated 
polypeptide. Delivery of a vector encoding either a full length or partial atonaZ-associated 
polypeptide in conjunction with a second vector encoding another gene product will have a 
combined anti-hyperproliferative effect on target tissues. Alternatively, a single vector 
encoding both genes can be used. 

E. Immunotherapy 

[0162] Immunotherapeutics, generally, rely on the use of immune effector cells 
and molecules to target and destroy cancer cells. The immune effector can be, for example, 
an antibody specific for some marker on the surface of a tumor cell. The antibody alone can 
serve as an effector of therapy or it can recruit other cells to actually effect cell killing. The 
antibody also can be conjugated to a drug or toxin (chemotherapeutic, radionuclide, ricin A 
chain, cholera toxin, pertussis toxin, etc.) and serve merely as a targeting agent. 
Alternatively, the effector can be a lymphocyte carrying a surface molecule that interacts, 
either directly or indirectly, with a tumor cell target. Various effector cells include cytotoxic 
T cells and NK cells. 

[0163] Immunotherapy, thus, could be used as part of a combined therapy, in 
conjunction with Ad-a/o/jc2/-associated gene therapy. The general approach for combined 
therapy is discussed below. Generally, the tumor cell must bear some marker that is 
amenable to targeting, i.e., is not present on the majority of other cells. Many tumor markers 
exist and any of these can be suitable for targeting in the context of the present invention. 
Common tumor markers include carcinoembryonic antigen, prostate specific antigen, urinary 
tumor associated antigen, fetal antigen, tyrosinase (p97), gp68, TAG-72, HMFG, Sialyl 
Lewis Antigen, MucA, MucB, FLAP, estrogen receptor, laminin receptor, eib B and pi 55. 
IX. Combination Treatments 

[0164] It can be desirable in utilizing the present invention to combine the 
compositions with other agents effective in the treatment of hyperproliferative disease, such 
as anti-cancer agents. An "anti-cancer" agent is capable of negatively affecting cancer in a 
subject, for example, by killing cancer cells, inducing apoptosis in cancer cells, reducing the 
growth rate of cancer cells, reducing the incidence or number of metastases, reducing tumor 
size, inhibiting tumor growth, reducing the blood supply to a tumor or cancer cells, 
promoting an inmiune response against cancer cells or a tumor, preventing or inhibiting the 
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progression of cancer, or increasing the lifespan of a subject with cancer. More generally, 
these other compositions would be provided in a combined amount effective to kill or inhibit 
proliferation of the cell. This process can involve contacting the cells with the expression 
construct and the agent(s) or multiple factor(s) at the same time. This can be achieved by 
contacting the cell with a single composition or pharmacological formulation that includes 
both agents, or by contacting the cell with two distinct compositions or formulations, at the 
same time, wherein one composition includes the expression construct and the other includes 
the second agent(s). 

[0165] Tumor cell resistance to chemotherapy and radiotherapy agents represents 
a major problan in clinical oncology. One goal of current cancer research is to find ways to 
improve the efficacy of chemo- and radiotherapy by combining it with gene therapy. For 
example, the herpes simplex-thymidine kinase (HS-tK) gene, when delivered to brain tumors 
by a retroviral vector system, successfully induced susceptibility to the antiviral agent 
ganciclovir (Culver, et aL, 1992). In the context of the present invention, it is contemplated 
that an atonal-associated gene therapy could be used similarly in conjunction wifli 
chemotherapeutic, radiotherapeutic, or immunotherapeutic intervention, in addition to other 
pro-apoptotic or cell cycle regulating agents. 

[0166] Alternatively, the gene therapy can precede or follow the other agent 
treatment by intervals ranging from minutes to weeks. In embodiments where the other agait 
and expression construct are applied separately to the cell, one would generally ensure that a 
significant period of time did not expire between the time of each delivery, such that the 
agent and expression construct would still be able to exert an advantageously combined effect 
on the cell. In such instances, it is contemplated that one can contact the cell with both 
modalities within about 12-24 h of each other and, more preferably, within about 6-12 h of 
each other. In soihe situations, it can be desirable to extend the time period for treatment 
significantly, however, where several d (2, 3, 4, 5, 6 or 7) to several wk (1, 2, 3, 4, 5, 6, 7 or 
8) lapse between the respective administrations. 

[01671 Various combinations can be employed, gene therapy is "A" and the 
secondary agent, such as radio- or chemotherapy, is "6": 

A/B/A B/A/B B/B/A A/A/B A/B/B B/A/A A/B/B/B B/A/B/B 

B/B/B/A B/B/A/B A/A/B/B A/B/A/B A/B/B/A B/B/A/A 
B/A/B/A B/A/A/B A/A/A/B B/A/A/A A/B/A/A A/A/B/A 
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[0168] Administration of the therapeutic expression constructs of the present 
invention to a patient will follow general protocols for the administration of 
chemotherapeutics, taking into account the toxicity, if any, of the vector. It is expected that 
the treatment cycles would be repeated as necessary. It also is contemplated that various 
standard therapies, as well as surgical intervention, can be applied in combination with the 
described hyperproliferative cell therapy. 

A. Inhibitors of Cellular Proliferation 

[0169] The tumor suppressor oncogenes function to inhibit excessive cellular 
proliferation. The inactivation of these genes destroys their inhibitory activity, resulting in 
unregulated proliferation. The tumor suppressors p53, pl6 and C-CAM are specific 
embodiments utilized in the present invention. Other genes that can be employed according 
to the present invention include Rb, APC, DCC, NF-1, NF-2, WT-1, MEN-I, MEN-H, zacl, 
p73, VHL, MMACl / PTEN, DBCCR-1, FCC, rsk-3, p27, p27/pl6 fusions, p21/p27 fusions, 
anti-thrombotic genes (e.g., COX-1, TFPI), PGS, Dp, E2F, ras, rayc, neu, raf, erb, fins, trk, 
ret, gsp, hst, abl, ElA, p300, genes involved in angiogenesis (e.g., A^GF, FGF, 
thrombospondin, BAI-1, GDAIF, or their receptors) and MCC. 

B. Regulators of Programmed Cell Death 

[0170] Apoptosis, or progranuned cell death, is an essential process for normal 
embryonic development, maintaining homeostasis in adult tissues, and suppressing 
c^cinogenesis (Kerr er a/., 1972). The Bcl-2 family of proteins and ICE-like proteases have 
been demonstrated to be important regulators and effectors of apoptosis in other systems. 
The Bcl-2 protein, discovered in association with follicular lymphoma, plays a prominent role 
in controlling apoptosis and enhancing cell survival in response to diverse apoptotic stimuli 
(Bakhshi et aL, 1985; Cleary and Sklar, 1985; Cleary et ai, 1986; Tsujimoto et ai, 1985; 
Tsujimoto and Croce, 1986). The evolutionarily conserved Bcl-2 protein now is recognized 
to be a member of a family of related proteins, which can be categorized as death agonists or 
death antagonists. Different family members have been shown to either possess similar 
fiinctions to Bcl-2 (e.g., BclXL, BclW, BclS, Mcl-l, Al, Bfl-1) or counteract Bcl-2 fimction 
and promote cell death (e.g., Bax, Bak, Bik, Bim, Bid, Bad, Harakiri). 

C. Other agents 

10171] It is contemplated that other agents can be used in combination with the 
present invention to improve the therapeutic efficacy of treatment. These additional agents 
include immunomodulatory agents, agents that affect the upregulation of cell surface 
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receptors and GAP junctions, cytostatic and differentiation agents, inhibitors of cell adhesion, 
or agents that increase the sensitivity of the hyperproliferative cells to apoptotic inducers. 
X. Dosage and Formulation 

[01721 The amino acid sequences, nucleic acid sequences, stem cells and/or 
regulatory factors (all active ingredients) of this invention can be foraiulated and 
administered to treat a variety of disease states by any means that produces contact of the 
active ingredient with the agent's site of action in the body of an animal They can be 
administered by any conventional means available for use in conjunction with 
pharmaceuticals, either as individual therapeutic active ingredients or in a combination of 
therapeutic active ingredients. They can be administered alone, or with a pharmaceutically 
acceptable carrier selected on the basis of the chosen route of administration and standard 
pharmaceutical practice. 

[0173] The dosage administered will be a therapeutically effective amount of 
active ingredient and will, of course, vary depending upon known factors such as the 
pharaiacodynamic characteristics of the particular active ingredient and its mode and route of 
administration; age, sex, health and weight of the recipient; nature and extent of symptoms; 
kind of concurrent treatment, frequency of treatment and the effect desired. 

[0174] The active ingredient can be administered orally in solid dosage forms 
such as capsules, tablets and powders, or in liquid dosage forms such as elixirs, syrups, 
emulsions and suspensions. The active ingredient can also be formulated for administration 
parenterally by injection, rapid infusion, nasopharyngeal absorption or dermoabsorption. The 
agent can be administered intramuscularly, intravenously, or as a suppository. In addition, 
parenteral solutions can contain preservatives such as benzalkonium chloride, methyl- or 
propyl-paraben and chlorobutanol. Suitable pharmaceutical carriers are described in 
Remington's Pharmaceutical Sciences, a standard reference text in this field. 

[0175] Additionally, standard pharmaceutical methods can be employed to control 
the duration of action. These are well known in the art and include control release 
preparations and can include appropriate macromolecules, for example polymers, polyesters, 
polyamino acids, polyvinyl, pyrolidone, ethylenevinylacetate, methyl cellulose, 
carboxymethyl cellulose or protamine sulfate. The concentration of macromolecules as well 
as the methods of incorporation can be adjusted in order to control release. Additionally, the 
agent can be incorporated into particles of polymeric materials such as polyesters, polyamino 
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acids, hydrogels, poly (lactic acid) or ethylenevinylacetate copolymers. In addition to being 
incorporated, these agents can also be used to trap the compound in microcapsules. 

[01761 Useful pharmaceutical dosage forms for administration of the compounds 
of this invention can be illustrated as follows. 

[0177J Capsules: Capsules are prepared by filling standard two-piece hard gelatin 
capsulates each with a therapeutically effective amount of powdered active ingredient, 175 
milligrams of lactose, 24 milligrams of talc and 6 milligrams magnesium stearate. 

[0178] Soft Gelatin Capsules: A mixture of active ingredient in soybean oil is 
prepared and injected by means of a positive displacement pump into gelatin to form soft 
gelatin capsules containing a therapeutically effective amount of the active ingredient. The 
capsules are then washed and dried. 

[0179] Tablets: Tablets are prepared by conventional procedures so that the 
dosage unit is a therapeutically effective amount of active ingredient. 0.2 milligrams of 
colloidal silicon dioxide, 5 milligrams of magnesium stearate, 275 milligrams of 
microcrystalline cellulose, 11 milligrams of cornstarch and 98.8 milligrams of lactose. 
Appropriate coatings can be applied to increase palatability or to delay absorption. 

[0180] Injectable: A parenteral composition suitable for administration by 
injection is prepared by stirring 1.5% by weight of active ingredients in 10% by volume 
propylene glycol and water. The solution is made isotonic with sodium chloride and 
sterilized — ^ 

[0181] Suspension: An aqueous suspension is prepared for oral administration so 
that each 5 millimeters contain a therapeutically effective amount of finely divided active 
ingredient, 200 milligrams of sodium carboxymethyl cellulose, 5 milligrams of sodium 
benzoate, 1.0 grams of sorbitol solution U.S.P. and 0.025 millimeters of vanillin. 

[0182] Accordingly, the pharmaceutical composition of the present invention can 
be delivered via various routes and to various sites in an animal body to achieve a particular 
effect (see, e.g., Rosenfeld et al. (1991), supra; Rosenfeld et aL, Clin. Res., 39(2), 31 lA 
(1991a); Jafife et al, supra; Berkner, supra). One skilled in the art will recognize that although 
more than one route can be used for administration, a particular route can provide a more 
immediate and more effective reaction than another route. Local or systemic delivery can be 
accomplished by administration comprising application or instillation of the formulation into 
body cavities, inhalation or insufflation of an aerosol, or by parenteral introduction, 
comprising intramuscular, intravenous, peritoneal, subcutaneous, intraderafial, as well as 
topical administration. 
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[0183] The composition of the present invention can be provided in unit dosage 
form wherein each dosage unit, e.g., a teaspoonful, tablet, solution, or suppository, contains a 
predetermined amount of the composition, alone or in appropriate combination with other 
active agents. The term "unit dosage form" as used herein refers to physically discrete units 
suitable as unitary dosages for human and animal subjects, each xmit containing a 
predetermined quantity of the compositions of the present invention, alone or in combination 
with other active agents, calculated in an amount sufficient to produce the desired effect, in 
association with a pharmaceutically acceptable diluent, carrier, or vehicle, where appropriate. 
The specifications for the unit dosage forais of the present invention depend on the particular 
effect to be achieved and the particular pharmacodynamics associated with the 
pharmaceutical composition in the particular host. 

[01841 These methods described herein are by no means all-inclusive, and further 
methods to suit the specific application will be apparent to the ordinary skilled artisan. 
Moreover, the effective amount of the compositions can be further approximated through 
analogy to compoxmds known to exert the desired effect. 

[01851 The following examples are offered by way of example, and are not 
intended to limit the scope of the invention in any manner. 

EXAMPLE 1 
MOVSE ATONAL HOMOLOG 1 (MATHl) 

[0186] It has been found that the present methods for the treatment of the hearing 
impaired have failed to address the problem directly, that is, the regeneration of auditory hair 
cell populations. The present invention in a preferred embodiment is directed to a member of 
the bHLH family, the Mathl gene or another a/o/ia/-associated nucleic acid sequence, and its 
requirement for generation of cerebellar granule neurons and inner ear hair cells. This 
discovery has wide ramifications not only for understanding neurodevelopment but also for 
therapies for a variety of prevalent disorders, as described below. 

[01871 The mouse atonal homolog 1 (Mathl) is expressed in the precursors of the 
cerebellar granule neurons; a few cells in the dorsal portion of the developing spinal cord; the 
inner ear; Merkel cells (touch receptors on the skins); and joints. Overexpressing Mathl in 
an otherwise differentiated cell can induce the fonnation or differentiation into a progenitor 
or mature inner ear hair cell-like cell. 

[01881 Mathl expression in the precursors of the cerebellar granule neurons 
suggests it is required for function in the cerebellum and brain. The cerebellum is essential 
for fine motor coordination and posture, and its dysfimction disrupts balance, speech and 
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limb movements. Cerebellar development typically begins at about embryonic day 9.5 (E9,5) 
when a small group of cells in the hindbrain proliferates and migrates rostrally to form the 
external granule layer, brain stem, and pontine neurons. This population of neuronal 
progenitors, which continues to express Mathl, further proliferates and migrates intemally to 
form the cerebellar granule neurons that are the predominant neuronal population in the 
cerebellum and brain. Mice that do not express Mathl completely lack cerebellar granule 
neurons and their precursors. Mathl is thus essential for the generation of these neurons and 
endows the very sparse population of neurons at E9.5 with the ability to proliferate into 
billions and then differentiate (Ben-Arie et al, 1997). Both these functions are of great 
medical significance. To understand normal proliferation provides necessary insight into 
abnormal proliferation, as observed in cancer. Cerebellar tumors of the primitive 
neuroectodermal type (e.g., meduUoblastoma) are the most common solid malignancy in 
children. Mathl expressing cells contribute significantly to these tumors. 

[01891 Mathl is expressed in the non-ossified joint cartilage (see Figure 6) that 
typically degenerates in osteoarthritis. This is the most prevalent form of arthritis, with 90% 
of people over 40 showing some degree of osteoarthritis in one or more joints. Given the 
properties of Mathl in cellular generation and proliferation, its artificial expression in 
affected joints can allow regeneration of the cells that constitute non-ossified cartilage. 

[0190] Disclosed herein are compositions and methods for the use of the Mathl 
gene, its human homolog (Hathl) or any of its homologs, orthologs, chimeric fusion proteins 
or derivatives of any suitable a/a/ia/-associated nucleic acid sequence or any another atonal- 
associated nucleic acid sequence. To learn about the functions of Mathl in manmials, the 
Mathl gene was deleted fi-om a mouse using a strategy that permitted detection of cells that 
express Mathl. Disclosed are the creation and characterization of mice that can be used to 
screen for compoimds which could be utilized to decrease or augment Mathl expression in 
inner ear hair cells and other cells in which Mathl expression is associated. 

[01911 Methods are also disclosed for the study, characterization and treatment of 
neoplastic proUferation of neuroectodermal origin since Mathl expression is essential for the 
generation and proliferation of cerebellar granule neurons. Also, it has been discovered that 
Mathl plays a role in the development of cells that produce non-ossified joint cartilage, 
which are associated with the development of osteoarthitis. These discoveries have led to a 
method of screening for compounds that can be helpful for the treatment of inner ear hair cell 
loss and other diseases that occur due to the functional loss of Mathl ^ such as osteoarthritis. 
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[0192] More particularly, the present invention also provides an animal 
heterozygous for Mathl gene inactivation or an another ato/ifl/-associated nucleic acid 
sequence, wherein at least one Mathl allele or another aronaZ-associated nucleic acid 
sequence has been replaced by insertion of a heterologous nucleic acid sequence, wherein the 
inactivation of the Mathl or otonaZ-associated sequence prevents expression of the Mathl or 
aro«a/-associated allele. The mouse can be further used to generate mice homozygous for 
Mathl or another aZo/ia/-associated sequence gene inactivation and can further include a 
second heterologous nucleic acid sequence, wherein at least one of the heterologous genes is 
used to detect expression driven by the Mathl or ato«a/-associated sequence regulatory 
elements. The complete or partial inactivation of the functional Mathl or ato/iaZ-associated 
sequence can be detected in, e.g., proprioreceptory cells, granule neurons and their progenitor 
cells, or non-ossified cartilage cells. 

[0193] Examples of heterologous nucleic acid sequences are reporter sequences 
such as b-galactosidase, green fluorescent protein (GFP), blue fluorescent protein (BFP), 
neomycin, kanamycin, luciferase, p-glucuronidase and chloramphenicol transferase (CAT). 
The Mathl or atonaZ-associated sequence can also be replaced under the control of 
regulatable promoter sequences or can be a tissue-specific promoter sequences. Said ^ 
promoter sequences can be partial or can contain the entire promoter. 

[0194] The present invention can also be used as, or as part of, a method for 
screening for a compound, wherein the administration of the compound affects a 
developmental and/or pathological gastrointestinal condition wherein said condition is a 
result of reduction in expression of the Mathl or ato/iaZ-associated sequence, the method 
including, administering the compound to a transgenic mouse that is homozygous for Mathl 
or flto/iaZ-associated sequence inactivation, wherein at least one Mathl or otowaZ-associated 
allele is inactivated by insertion of a heterologous nucleic acid sequence, wherein the 
inactivation of the Mathl or a/o/iaZ-associated sequence prevents expression of the Mathl or 
a/o/JflZ-associated gene, and monitoring the mouse for a change in the developmental and/or 
pathological gastrointestinal condition. As used herein, the screen provides for a compound 
that by upregulating expression of a heterologous nucleic acid sequence is a positive effector 
and for a compound that by downregulating expression of a heterologous nucleic acid 
sequence is a negative effector. 

(01 95] Yet another embodiment of the present invention is a method of promoting 
mechanoreceptive cell growth, that includes contacting a cell with a Mathl or atonal- 
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associated protein or gene in an amount effective to cause said cell to express an inner ear 
hair cell marker. An example of a hair cell marker for use with the method is cahretinin. The 
cell can be contacted with a vector that expresses a Mathl or oto/iaZ-associated nucleic acid 
sequence or amino acid sequence. Mathl or atonaZ-associated nucleic acid sequence- 
expressing recombinant vectors can include an adenoviral vector, a retroviral vector, an 
adeno-associated vector, a plasmid, a liposome, a protein, a lipid, a carbohydrate and a 
combination thereof of said vectors. Mathl or atonal-associaXed sequence can be imder the 
control of, e.g., a cytomegalovirus IE promoter sequence or the cytomegalovirus IE promoter 
sequence and a SV40 early polyadenylation signal sequence, or any other combination of 
appropriate promoter sequences, enhancer sequence, and polyadenylation. 

[0196] Furthemiore, a method is disclosed for treating hearing impairment or an 
imbalance disorder that includes administering to an animal, including a human, with hearing 
loss or an imbalance disorder a therapeutically effective amount of a Mathl or atonal- 
associated amino acid sequence or nucleic acid sequence. The hearing or balance impairment 
can be complete or partial and can affect either one ear or both ears. In a preferred 
embodiment, there is a substantial impairment of hearing. Hearing and an imbalance disorder 
can be affected separately or concomitantly in an animal to be treated, and said hearing 
and/or an imbalance disorder could be as a result of trauma, disease, age-related condition, or 
could be due to loss of hair cells for any reason. 

[0197] The present invention is also directed to a composition that includes a 
Mathl or a^onaZ-associated protein or gene in combination with a delivery vehicle, wherein 
the delivery vehicle causes a therapeutically effective amount of Mathl or a^onaZ-associated 
sequence to be delivered into a cell The delivery vehicle can be further defined as a vector 
that comprises a Mathl or ato/iaZ-associated amino acid sequence or nucleic acid sequence in 
an animal cell. The vector can be a retroviral or an adenoviral vector or any other nucleic 
acid based vector, which can even be dispersed in a pharmacologically acceptable 
formulation, and used for intralesional administration. The composition can even be a 
partially or fully purified protein that is delivered using a liposome, a protein, a lipid or a 
carbohydrate that promotes the entry of a Mathl or aronaZ-associated protein into a cell 
Examples of proteins that can be used as delivery vehicles include the receptor-binding 
domains (the non-catalytic regions) of bacterial toxins, such as, e.g., Exotoxin A, cholera 
toxin and Ricin toxin or protein transduction domains, such as firom the HIV TAT protein 
(Schwarze et al, 1999) (see Example 22). The composition for delivering Mathl can be a 
fusion protein. 
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[0198] A skilled artisan is aware that methods to treat animals as disclosed in the 
invention can be either in utero or after birth. Treatment can be given to an embryo and can 
occur either ex vivo or in vivo. 

EXAMPLE 2 
ANIMAL MODEL FOR ORGANOGENESIS 

[01991 An effective animal model for deficiency in a gene that controls 
organogenesis will most often have both alleles stably inactivated so that, throughout 
embryogenesis, one or more tissues cannot revert to a fiinctional wild-type allele. One 
method of generating animals with an altered genotype is gene targeting (Mansour et al, 
1993), in which homologous recombination of newly introduced DNA sequence the 
targeting sequence or construct) and a specific targeted DNA sequence residing in the 
chromosome results in the insertion of a portion of the newly introduced DNA sequence into 
the targeted chromosomal DNA sequence. This method is capable of generating animals of 
any desired genotype, and is especially usefiil for gene disruption (/.e., to "knock out") at a 
specific chromosomal gene sequence by inserting a selectable marker into the gene or 
completely replacing the gene with another nucleotide sequence. 

[0200] To knock out a genomic sequence, a cloned fi-agment must be available 
and intron-exon boundaries within the firagment defined (Mansour et al, 1993). Typically, 
the targeting construct contains a selectable marker such as Neo (neomycin resistance, see 
Mansour et al, 1993) flanked by sequences homologous to the chroniosomal target DNA, 
and beyond one of these flanking sequences the herpes simplex virus thymidine kinase gene 
(HSV-TK, see generally, McKnight et a/., 1980). The targeting construct is introduced, e.g.. 
by electroporation, into embryo-derived stem (ES) cells where homologous recombmation 
results in an insertion of the Neomycin resistance marker (Neo), but not the HSV-TK gene, 
into the targeted chromosomal DNA sequence. The altered ES cells are neomycin resistant 
and HSV-TK- and so are able to grow in the presence of both 0418 and gancyclovir 
antibiotics. Random insertions contain the HSV-TK gene and are thus sensitive to 
gancyclovir (Mansour, et al). Positive ES clones are then microinjected into blastocysts to 
generate germ-line chimeric mice, which are then bred to obtain progeny that are 
homozygous for the knock out gene. Such general methods of generating knock out animals 
have been demonstrated using mice. Genes in other animals such as rats, guinea pigs, 
gerbils, hamsters, and rabbits, can also be used as long as sufficient DNA sequence data are 
available to make an appropriate targeting construct to knock out the gene of interest. 
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[0201] Although ato and Mathl share a high degree of sequence conservation, 
there was an apparent discrepancy between their expression patterns and the consequences of 
their loss of function. Whereas ato is expressed primarily in the PNS of the fly and its 
absence causes loss of almost all CHOs (Jarman et al, 1993), Mathl is expressed in the CNS 
and its loss leads to absence of cerebellar granule neurons, the largest neuronal population in 
the CNS (Ben-Arie et a/., 1997). To better understand the functional relations between ato 
and Mathl, the present invention describes generation of a second Mathl null allele in mice 
{Mathl^'^^^'^^) by replacement of the Mathl coding region with a p-galactosidase gene 
(lacZ) and performing a subsequent search for CNS expression of ato in the fruit fly. The 
Examples describe a functional link between ato and Mathh ato is expressed in the fly brain, 
and lacZ expression under the control of Mathl regulatory elements {MathlllacZ) not only 
replicated the known expression pattern in the CNS (/.e., the neural tube, spinal cord and 
cerebellum), but appeared in many other cells of the murine PNS. Overexpression of Mathl 
in Drosophila caused ectopic CHO foraiation, providing further evidence that ato and Mathl 
are functionally conserved. 

[0202] The connections and consistency of the relationship between atonal m 
Drosophila and Mathl in the mouse suggests that their use as model systems in the art is 
justified. A family of homologues have been cloned and analyzed in the mouse including 
MATHl, 2, 3, 4A, 4B, 4C and 5 (Azakawa et aL, 1995; Bartholoma and Nave, 1994; Ben- 
Arie et al., 1997; Ben-Arie et al, 1996, Fode et al., 1998; Ma et al, 1998; McCormick et al, 
1996; Shimizu et al, 1995; Takebayashi et aL, 1997). It has been suggested that Mathl and 
MathS are the only true ato homologues given their amino acid sequence criteria, sharing 
67% and 71% identity with the bHLH domain of ATO, respectively (Ben-Arie et a/., 2000). 
A Xenopus atonal homolog, Xathl has been ectopically expressed in Drosophila and shown 
to behave similarly to ato (Kim et a/., 1997). Furthermore, the ability of Mathl to induce 
ectopic CHO formation and to restore CHOs to ato mutant embryos (see Example 13) is 
strong evidence that Mathl, and particularly its basic domain, encodes lineage identity 
information not unlike that encoded by ato and that mammalian cells expressing Mathl are 
functionally similar and perhaps evolutionarily related to Drosophila cells that require ato. 
Thus, the similarities between atonal in Drosophila, Xathl in Xenopus and Mathl in the 
mouse indicate that these animals are comparable animal model systems. Furthermore, the 
widespread use of mice in particular as a model system for humans also suggests that it 
similarly would allow utilization of the invention in humans. 
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[0203] With advances in molecular genetics now standard in the art, sequences 
from humans and other species can be used interchangeably in a variety of organisms. For 
example, the rat inducible hsp70 gene was used to produce transgenic mice that 
overexpressed inducible hsp70, allowing organs from transgenic mice to be protected from 
ischemic injury (Marber et al J. Clin. Invest. 95:1446-1456 (1995)) due to the increase in rat 
hsp70. Sequences in other animals have been interchanged including between humans and 
rodents to develop rodent models to study human disease, i.e. neurodegenerative diseases. 
One such example is the expression of the human SCAl gene, which encodes ataxin-1, m 
mice (Burright, E. N. et al Cell 82:937-948 (1995)). Transgenic mice were generated 
expressing the human SCAl gene with either a normal or an expanded CAG tract. The data 
illustrated that the expanded CAG repeats were expressed in sufficient amounts in the 
Purkinje cells to produce degeneration and ataxia. This example illustrates that a mouse 
model can be established to study spinocerebellar ataxia type 1, which is an autosomal 
dominant inherited neurologic disorder. In addition to developing mouse models, Drosophila 
is a halhnark model system in the field. Warrick et al. (1999) produced transgenic flies 
which co-expressed human hsp70 and a human mutant polyglutamine (MJDtr-Q78). 
Expression of the human mutant polyglutamine MJDtr-Q78 alone in the flies resulted m the 
formation of large aggregates in neurons. However, co-expression with human hsp 70 
resulted in suppressed aggregation. These examples illustrate that interchangeability of genes 
is routine in the field of molecular genetics and model systems provide powerfiil tools to 
characterize gene fimction. 

EXAMPLE 3 
GENERATION OF TRANSGENIC MATHl MICE 

[0204] To detect subtle Mathl expression patterns not identified by RNA in situ 
hybridization, and thus further illuminate this gene's role during embryonic development, 
Mathl null alleles {Mathl^'^''^^'^^) were generated by replacing the Mathl coding region 
with p-galactosidase (•^'^^*). 

[0205] The targeting construct, containing a lacZ cassette and a PGK-neo cassette 
(Fig. 7A), was used to replace the Mathl coding region. To delete the entire coding region of 
Mathl, a targeting construct was generated that contained the 5' and 3' genomic flanking 
Augments as described previously (Ben-Arie et al, 1997) flanking a pSApgal/PGK-neo 
cassette (Friedrich and Soriano, 1991). The construct is designed so that lad expression is 



wo 03/047532 



driven by endogenous Math! control elements, while an independent PGK promoter drives 
the expression of the selectable marker neo. 

[0206] The construct was electroporated into ES cells and selection for neo was 
achieved with G418. Fourteen out of 76 (18%) clones underwent homologous 
recombination. Genotyping of ES cells, yolk sac and tail DNA was performed using 
Southern analysis of EcoR I digested DNA and probes previously described (Ben-Arie et al., 
1997). The targeting construct was electroporated into embryonic stem (ES) cells; 14/76 
(18%) clones exhibited correct homologous recombination at the Mathl locus (Fig. 7B). 

[0207] Three ES cell lines carrying the Mathl^'^'^^ allele wwe injected into host 
blastocysts to generate chimeric mice. Mathf'^ mice were identified and intercrossed to 
generate homozygotes (Fig. 7C). The Mathl deletion was confirmed by Southern analysis 
using both flanking and internal probes (Fig. 7A). 

[0208] Mathl^'^"^^'^''^ mice show all the phenotypic features rqjorted in the 
Mathl-'- mice (Ben-Arie et a/., 1997; 2000). 

EXAMPLE 4 
X-GAL STAINING. HISTOLOGICAL AND 
IMMUNOHISTOCHEMICAL ANALYSES 

[0209] Embryos were staged by vaginal plug, with the morning of the plug 
designated E0.5. Embryos were dissected out of the uterus, separated firom extraembryonic 
membranes, and placed in cold phosphate buffered saline (PBS). The embryos were then 
fixed in 4% paraformaldehyde (PFA) in PBS for 30 minutes, and washed in cold PBS. Yolk 
sacs or tails were collected before fixation for DNA extraction and genotyping. Equilibration 
to improve the penetrability of the staining reagents was performed in 0.02% NP40, 0.01% 
sodium deoxycholate in PBS for 10 minutes at room temperature. Whole mount staining 
with X-gal (Bonnerot and Nicolas, 1993) was performed for 16-24 hours at 30oC while 
shaking in the same equilibration buffer, which also contained 5mM potassium ferricyanide, 
5mM potassium ferrocyanide, and 40 mg/ml X-gal (dissolved in DMSO). When the desired 
intensity of staining was achieved, usually within 18 hours, embryos were washed in PBS, 
postfixed for 30 minutes in buffered formalin, serially dehydrated in 25, 50, and 70% ethanol, 
and stored at 4*C. 

[0210] For histological analysis embryos were fiirther dehydrated in 80, 90, and 
100% ethanol, treated in Histoclear (National Diagnostics), and embedded in Paraplast 
(Oxford Labware). Seven to 20 ]x.m sections were cut using in a microtome (Microme). 
Counterstaining was performed using nuclear fast red (Vector Laboratories). 
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Immunohistochemistry was performed as detailed previously (Ben-Arie et al, 1997). 
Antibodies: Anti-cytokeratin 18 (DAKO) 1:20; Anti-human Chromogranin A (DAKO) 
1 : 100; Anti-MATHl (see below) 1 :200. 

EXAMPLE 5 

EXPRESSION PATTERNS IN TRANSGENIC MATHl MICE 

[02111 As expected, P-Gal expression in the cerebellum and dorsal spinal cord is 
identical to that of Mathl, and interestingly, P-Gai is also expressed throughout the otic 
vesicle epithelia at El 2.5 and in the sensory epithelia of the utricle, saccule, semicircular 
canals, and cochlea at E14.5 and E15.5 (Figures lA and IB). Utricles were obtained from 
C57BU129SVEV mice. 

[02121 Gross morphological analysis of the inner ear of Math 1^'°^^ mice at 
El 8.5, one day before full gestation, revealed no obvious defects in overall structure and size 
compared with wild type (wt) littermates. The branches of the Vlllth cranial nerve were 
present and reached the epithelia, but degenerated due to absence of the hair cells. 

[0213] The sensory epithelia were examined in detail. The utricles and cochleas 
of wild-type, Matht'^'^\ and Mathl^ ^"^^ mice were excised to allow viewing of the 
sensory epithelia with Nomarski optics. Hair bundles were present in both organs of wild- 
type and heterozygotes, but were completely absent in Mathl null litter-mates. Scaiming 
electron microscopy (SEM) of the cochlea and vestibular organs confirmed the absence of 
hair bundles in null mice (Figures 2A through 2F). To determine whether lack of hair 
bundles reflects the absence of hair cells, cross-sections of the sensory epithelia of all inner 
ear organs using both light and transmission electron microscopy (LM and TEM, 
respectively) were examined (Figures 3A through 3F). LM and TEM were carried out as 
described previously (Lysakowski and Goldberg, 1997). Tissue preparation for SEM 
consisted of osmication (1% Os04 in cacodylate buffer), dehydration, critical-point drying, 
sputter-coating with gold, and examination in a JEOL 358 electron microscope. 

[0214] Light microscopy revealed that sensory epithelia in null mice are 
considerably thinner, lack the normal stratification of cell nuclei and stain uniformly, all of 
which are consistent with the absence of hair cells. TEM clearly distinguishes between hair 
cells and supporting cells in normal utricles: hair cells have hair bundles, less electron-dense 
cytoplasm, more apical nuclei, and no secretory granules (Figures 4A and 4B). The sensory 
epitheUa of the null mutants lack hair cells entirely but do have supporting cells with normal 
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appearance (RUsch et al, 1998), including electron-dense cytoplasm, basal nuclei, and 
secretory granules. However heterozygous Mathl mice retain hair cells. 

EXAMPLE 6 

EXPRESSION OF A HAIR CELL SPECTFIC MARKER 
IN TRANSGENIC MATHl MICE 

[02151 Lack of hair cells at Ei8.5 can be due to (1) lack of sensory cell 
progenitors, (2) the inability of progenitors to differentiate into hair cells, or (3) the inability 
of hair cells to maintain the differential states, as has been observed in the absence of the 
POU domain transcription factor Bm3c. The first possibility is unlikely because progenitors 
give rise to both hair cells and supporting cells. To evaluate the remaining possibilities, the 
expression of the hair cell specific marker, cah^tinin and myosin VI were examined. 
Calretinin is a member of the calcium binding family of proteins and is expressed in 
differentiating hair cells (prior to hair bundle formation) and mature inner ear and auditory 
hair cells, but not in supporting cells. Calretinin expression in Mathl^"^^ ^"^^ and wild-type 
mice was studied by immunofluorescense on coronal sections of E15.5, E16.5 and E18.5 
embryos (Figures 5A through 5F). 

10216] For immunofluorescence, embryos were fixed for 1.5 hours in 4% 
paraformaldehyde/PBS at 4*C, sunk through 15% sucrose/PBS for 5 hours then 30% 
sucrose/PBS overnight, and snap fix)zen in a 2-methylbutane dry ice bath. 14 \im sections 
were cut on a cryostat and mounted onto gelatin-coated slides. Sections were fixed onto 
slides by dipping for 10 minutes in Streck tissue fixative (Streck laboratories) and air drying. 
Sections were blocked in 30% normal goat serum and 0.3% triton X-100 in PBS for 1 hr at 
room temperature (RT). Rabbit anti-caketinin polyclonal antibody (Chemicon laboratories) 
was diluted 1:200 in blocking solution and incubated overnight on sections at 4°C. Sections 
were washed 3 times (20 minutes each) in Phosphate-Buffered Saline (PBS) at RT. The 
secondary antibody anti-rabbit antibody, Alexa 488 (Molecular Probes), was diluted 1:400 in 
blocking solution and used to detect calretinin. Sections were covered and incubated at RT 
for 2 hours before washing and mounting in Vectashield containing DAPI (Vector). For 
confocal microscopy, sections were treated with 25 ng/ml RNAse before counterstaining 
with 50 ng/ml of propidium iodide and mounted in Vectashield without DAPI. Stained 
sections were viewed under a Bio-Rad 1024 confocal microscope. 

[02171 Cahretinin-positive cells are clearly visible in the sensory epithelia of the 
semicircular canals and utricles of wild-type mice, but Mathl ^"^^^"^ embryos lack 
calretinin expression at all three states. Using the mouse model disclosed herein the present 
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inventors demonstrate that hair cells never develop within the sensory epithelia ofMathr 

mice. The presence of the tectorial and otolithic membranes (secreted in part by the 

supporting cells), together with the TEM results, suggests that the remaining cells in the 

sensory epitheUa of the Mathl^'^''^^'^''^ mice are functional supporting cells. 

EXAMPLE 7 
M4mi/JL^CZ EXPRESSION MIMICS MATHl 
EXPRESSION IN THE DEVELOPING CNS 

[02181 The developing cerebellum at E14.5 and postnatal day 0 (PO) in Matht'^ 
and Mathl ^'^^ ^ mice were analyzed by RNA in situ hybridization analysis. 

[0219] The analysis showed that the expression pattern of the lacZ gene faithfully 
reproduced the Mathl expression pattern observed by RNA in situ hybridization analysis 
shown previously (Akazawa et ai, 1995; Ben-Arie et aL, 1996) (Fig. 2A, B, E, G). 
Moreover, the cerebellar phenotype in Mathl P-8^"P-8^' mice (Fig. 8F and 8H) was identical to 
that observed in Mathl null mice (Ben-Arie et aL, 1997). At E14.5, the precursors of the 
EGL are present in the rhombic lip from which they migrate over the cerebellar anlage to 
populate the EGL (Fig. 8E). Mutant mice displayed far fewer of these cells than 
heterozygous mice (Fig. 8F). At PO, the neurons of the external granule layer (EGL) were 
completely lacking (Fig. 8H). 

[0220] MathlllacZ expression in the developing hind brain and spinal cord 
similarly reproduced the expression pattern of Mathl (Fig. 8C, 8D). The only notable 
difference between the expression patterns established by in situ hybridization and lacZ 
staining is that b-galactosidase expression persists in differentiating or migrating cells of the 
spinal cord because of the stability of the p-GAL protein (Fig. 8D). In summary, the neural 
tissue expression pattern and cerebellar phenotype associated with the replacement of the 
Mathl coding region by lacZ is consistent with previously published data on Mathl 
expression (Akazawa et aL, 1995; Ben-Arie et aL, 1997; Ben-Arie et aL, 1996; Hehns and 
Johnson, 1998), demonstrating that the endogenous control elements were not disrupted by 
insertion of the lacZ gene. Moreover, many previously undetected clusters of lacZ- 
expressing cells became apparent upon X-gal staining of whole embryos and sections in 
Mathl*'^ mice (see below). It is likely that limitations in the spatial resolution of RNA in 
situ hybridization techniques used to detect the transcript in earlier studies prevented these 
sites of expression from being discerned (Akazawa et aL, 1995; Ben-Arie et aL, 1996). 
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Alternatively, the stability of the lacZ gene product and the increased sensitivity due to signal 
amplification allowed us to identify sites of relatively low expression levels. 

EXAMPLE 8 

AfA THIILACZ IS EXPRESSED IN INNER EAR SENSORY EPITHELIA 

[0221] The sensory organs of the inner ear were among the newly identified sites 
of MathlllacZ expression, demonstrated utilizing the methods described in Example 2. 
Expression in the otic vesicle was first detected at El 2.5 and continued until El 8.5 
throughout much of the sensory epithelia (Bermingham et al, 1999) (Figure 9A, 9B). Null 
mutants displayed MathlllacZ expression in the inner ear throughout embryogenesis (Fig. 
9C). Mathl null mutants lack hair cells in all of the sensory organs (Bermingjiam et al, 
1999), but maintain supporting cells, the other sensory q)ithelia-derived cells (Fig. 9C). 
These supporting cells seem to be fimctional, based on their morphology and the presence of 
overlying membranes secreted in part by these cells. Although the expression oi Mathl in 
inner ear sensory epithelia was not demonstrated by RNA in situ hybridization analysis, the 
complete lack of inner ear hair cells in the null mutants leaves little doubt about the 
authenticity of the MathlllacZ expression pattern. 

[0222] Mathl is clearly essential for hair cell development in the inner ear. Its 
expression pattern and in vivo function are akin to those of Mathl's proneural homolog, 
atonal (ato) (A. P. Jarman, Y. Grau, L. Y. Jan, Y. N. Jan, Cell 73, 1307-21 (1994)). ato is 
expressed in a ring of epithelial cells within the antennal disc of Drosophila. Some of these 
epithelial cells will subsequently develop into mechanoreceptors in the Johnston organ, which 
is necessary for hearing and negative geotaxis. It is interesting to note that mechanoreceptor 
progenitor cells are absent in ato mutante, whereas only the mechanoreceptors, and not their 
progenitors, are absent in Mathl null mice. . 

[0223] Based on the observations made herein, the present inventors have 
recognized that Mathl is required for the specification of inner ear hair cells. In a sense, 
Mathl acts as a "pro-hair cell gene" in the developing sensory epithelia. In conjunction with 
two recent studies, the present inventors have recognized that the results provided herein 
provide evidence supporting a lateral inhibition model for the determination of hair cells and 
supporting cells (Haddon et al., 1998; Adam, et al., 1998), in which the interplay of Delta, 
Notch, and Serratel results in the selection of individual hair cells firom clusters of competent 
cells. Such a model entails that the sensory epithelia express a "pro-hair cell gene" whose 
function is essential for hair cell &te specification. 
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[0224] The ectopic expression of ato in the fruitfly and its homolog Xathl in 
Xenopus (Kim et al., 1997) can recruit epithelial cells into specific neuronal fates, and the 
expression of Mathl in inner ear epithelia strongly suggests loss of a functional Mathl gene 
is likely to be a common cause of deafiiess and vestibular dysfunction. 

EXAMPLE 9 

MATHl/LACZ IS EXPRESSED IN BRAIN STEM NUCLEI 

[02251 In the brainstem Mathl llacZ staining appeared from El 8.5 to P7 in the 

vaitral pons in the regions corresponding to the pontine nuclei (Fig. 9D and inset). This 

finding is consistent with the hypothesis of Akazawa and colleagues that Mart /-positive cells 

in the developing hind brain are precursors to the bulbopontine neurons (Akazawa et al., 

1995). No such staining appeared in the null mutants (Fig. 9E and inset). These data raise 

the possibility that the absence of lacZ staining in pontine nuclei can be due to failure of their 

precursors to migrate, proliferate, and/or differentiate. Ventral pontine nuclei were examined 

upon haematoxylin and eosin staining of sections and were found to be missing in the brain 

stem of null mice (Fig. 9F, G). Furthermore, the failure of null mouse newborns to breathe 

can be due to absence of these brainstem neurons. 

EXAMPLE 10 
MATHI/LACZIS EXPRESSED IN CHONDROCYTES 

[0226] Mathl*'^'^^ heterozygotes displayed expression of Mathl in articular 
cartilage (Figures 6A and 6B). Figure 6A demonstrates expression in all joints of a forelimb. 
Upon closer examination of an elbow jomt, Mathl is noted to be expressed exclusively in the 
non-ossified articular chondrocytes. 

[0227] Expression of Mathl /lacZ was detected in the developing proximal joints, 
such as those of the hip and shoulder, as early as E12.5 (Fig IDA). X-gal positive staining 
was detected at subsequent developmental stages in a progressive proximal-distal pattern that 
paralleled the normal development of joints (Figure lOB). In the joints, MathlllacZ 
expression immediately follows mesenchymal condensation, which begins at Ell .5. 
Condensed mesenchyme cells differentiate into chondrocytes (Bi et a/., 1999; Horton et al., 
1993; Karsenty, 1998). 

[0228] Chondrocytes differentiate in three major phases during bone formation: 
resting, proliferating and hypertrophic. The resting chondrocytes that populate the articular 
cartilage are referred to as articular chondrocytes (Buckwalter and Mankin, 1998; Poole, 
1997). Prior to birth, resting chondrocytes constitute the entire chondrocyte population in 
joints. To establish which cells expressed MathlllacZ, sections from E18.5 and P7 Mathl*^' 



wo 03/047532 



PCTAJS02/414S8 



mice were stained with X-gal. MathlllacZ is expressed in the resting chondrocytes of all 
joints analyzed at E18.5; resting chondrocytes in the elbow joint are shown in Figure IOC, 
and Fig. lOD shows the resting, proliferating, and articular chondrocytes of a P7 mouse. 

[0229] The joints of El 8.5 embryos were examined with anti-MATHl antibody 
prepared by the following methods. An EcoR I-Hind m fragment encoding the N-terminal 
156 amino acids of the Ma/Aiopen reading frame (MathlD) was cloned into the pET 28a+ 
expression vector (Novagen). MathlD fragment was expressed as a His tag fusion protein. 
Soluble MATHID priatein was purified according to His-tag kit specifications (Novagen) and 
2mg of protein were used to immimize Chickens (Cocalico Biologicals Inc.). 

[0230] Expression was found in resting chondrocytes, who-eas no expression was 
observed in null embryos. It should be noted that not all articular cartilage cells express 
MathlllacZ (Fig. lOE). MathlllacZ expression in Mathl null mutants is similar to that in 
heterozygous mice at El 8.5, suggesting that Mathl is not required for resting chondrocyte 
development. 

EXAMPLE 11 
MATffllLACZ IS EXPRESSED IN MERKEL CELLS 

(02311 By E14.5 MirA7//flcZ-positive cells were apparent around the vibrissae 
and in the skin of much of the body (Fig. lOB). In the trunk, the stained cells were arranged 
in a striped pattern defined by the epidermal ridges. This staining was apparent only in the 
hairy, not the glabrous, skin. All the primary (mystical) vibrissae, including the lateral nasal, 
maxillary and four large hairs, were positive for MathlllacZ. Staining was also detected in 
the secondary vibrissae, including the labial, submental, rhinal, and isolated orbital vibrissae 
(supra-, infra- and post-orbital) (Yamakado and Yohro, 1979). By E15.5 staining appeared in 
clusters of cells in the foot pads (Fig. lOB). 

[0232] To identify the Mathl llacZ-positive cells in the vibrissae, footpad, and 
hairy skin, histological sections from Mathl*'^"^ mice were examined (Fig. IIA-D). 
Sections through the vibrissae showed that the stained cells are localized to the more apical 
half of the hair shaft, but are not in the hair itself Cross sections through the foot pad 
illustrated staining of cluster of cells in the epidermal layer (Fig. 1 1 B, C). As shown in Fig. 
IID, sections through the truncal skin identified clusters of Mathl I lacZ-stmned cells. The 
stained cells were arranged in a horseshoe-shaped pattern centered within an elevated button- 
like structure in the hairy skin. These button-like structures were identified as touch domes 
or Haarscheiben (Pinkus, 1905), which are characterized by a thickened epidermis and an 
elevated dermal papilla with a capillary network. Touch domes are associated with large 
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guard hairs dispersed between other hair types in the coat. The spatial distribution of 
AfarAi//acZ-stained cells, the timing of their appearance at E14.5, and their localization 
within the mystical pads of the vibrissae and the touch domes in the hairy skin suggest that 
these cells correspond to Merkel cells, specialized cells in the epidermis that form slow- 
adapting type I mechanoreceptor complexes with neurites (Munger, 1991). 

[0233] The results of comparative analysis of the MathlAacZ expression pattern 

ft oal/ ft ><99l 

in heterozygous and homozygous E16.5 animals are shown in Figure 1 lE-L. Mathl ^'^ 
embryos displayed a staining pattern similar to that of Math t'^'^^ littermates in the vibrissae 
and footpads (Fig. 1 lE-G, I-K). In contrast, staining in the touch domes of the hairy skin was 
barely detectable in Mathl^'^^ embryos (Fig. 11H,L). The reduction of staining in null 
animals was also obvious at E18.5. 

[02341 To further define Ma//j//7flcZ-positive cells in the skin, Matht'^'^ mice 
were mated to Tabby mice. Tabby (Ta) is a spontaneous X-linked mutation displaying a 
similar phenotype in hemizygous males and homozygous females (Ferguson et aL, 1997). 
Tabby mutants lack hair follicles (tylotrich), a subset of Merkel cells that are associated with 
touch domes in the hairy skin of the trunk (Vielkind et aL, 1995), and some of the five 
secondary vibrissae on the head (Gruneberg, 1971). Hence, in a cross of Ta/Ta females with 
a heterozygous Matht'^'^^ male, 50% of the male progeny are Ta/Y: Matht'^\ allowing 
us to assess whether the MathJ/lacZ-posi^ve cells correspond to Merkel cells. 

" [0235] Ta/Ta females were time-ihated with Mathl*' ^'^^ males, and embryos 
were harvested at E16.5. Each pup's gender was determined by PGR on tail DNA, using 
primers (forward 5' -TGAAGCTTTTGGCTTTGAG-S'; SEQ ID NO:67, and reverse 

[0236] 5 ' - CCGCTGCCAAATTCTTTGG-3 '; SEQ ID NO:68) that yielded a 320 
bp product from chromosome X, and a 300 bp product from chromosome Y (Liu et aL, 
1999). Amplification conditions were: 92''C/ 1 min, 55°C/1 min, 72''C/ 1 min for 32 cycles, 
with an initial denaturation step of 94''C/ 7 min and last extension step of 72°C/ 7 min. 
Amplification products were separated on 2% agarose gels. X-gal-stained embryos were 
scored independently by 2 individuals, and only then were results matched with the 
determined gender. 

[0237] Both Tabby females and males carrying the Mathl*'^'^ allele displayed X- 
gal staining in the vibrissae and foot pads (Fig. 12 A, B). The effect of the Tabby mutation on 
the nimiber of secondary vibrissae was quite clear: hemizygous males completely lacked 
Mflrt7/lacZ-positive cells in the secondary vibrissae (typically lacking in Ta mutants) and on 
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the trunk (Fig. 12E). Females that are heterozygous for Tabby showed patchy staining in the 

touch domes (although less than wt), as should be anticipated in female carriers of a mutation 

in a gene that undergoes random X chromosome inactivation (Fig. 12C, 12D). The 

localization and distribution of the positive cells, as well as their absence in selected vibrissae 

and the trunk of Tabby males, strongly indicate that Mathl is expressed in the Merkel cells 

associated with guard follicles in the touch domes of the hairy skin. 

[0238] To ascertain whether MathlllacZ staining pattern reflects normal Mathl 

expression pattern, inmiunohistochemical analysis of MATHl was performed on sections 

from abdominal skin (see Example 2). As seen in Fig. 13A and B, MATHl-positive cells 

were detected around the hair follicles oiMathf^'^ but not Mathl P-s^^P'Sai ^^^^ Antibodies 

against two Merkel cells markers were chosen for further analysis: anti-cytokeratinl8, 

expressed in simple epithelia, and chromogranin, localized to secretory granules of 

neuroendocrine, endocrine, and neuronal tissues. Both cytokeratin 18 (Fig. 13C,D) and 

chromogranin A (Fig. 13E, 13F) confirmed the identity of the Mathl /lacZ-positive cells as 

Merkel cells, but did not reveal staining abnormalities in Mathl^'^^^'^ mice. Thus, Mathl 

does not seem to be essential for the genesis of the neuroendocrine Merkel cells, in contrast 

to pure neuronal cell types like cerebellar EGL and pontine nuclei. 

EXAMPLE 12 
MATHl PARTIALLY RESCUES CHINESE HAMSTER 
OVARY CELLS (CHO^ IN FLIES DELETED FOR^rO 

[0239] This Example demonstrates that arowaZ-associated genes can induce the 
development of CNS cells in animals deficient in a native atonal-associated gene or gene 
product. This Example also demonstrates that arc?«a/-associated genes can therapeutically 
function in species in which they are not natively expressed. 

[0240] Given the remarkable similarity in expression patterns of ato and Mathl ^ 
and their identical basic domains, Mathl was tested to see if it would mimic the effects of ato 
overexpression by producing ectopic chordotonal organs as described by the following 
methods. Wild-type, also known as yw flies, were transformed with a UAS-Marti construct 
as described (Brand and Perrimon, 1993). To overexpress Mathl in wild type flies, yw; 
HAS'Mathl flies were mated to HS-Gal4 flies. The progeny were heat shocked as 
previously described (Jarman et a/., 1993). To rescue the loss of chordotonal organs in ato 
mutant flies, w; JJAS-Mathl/UAS-Mathl; atoirTM6 flies were crossed to w; HS-Gal4/CyO; 
atol/TM6 flies. Embryos were collected for 3 hr., aged for 3 hr., heat shocked for 30 min. at 
37'*C and allowed to develop for the next 12-15 hr. Embryos were fixed in 4% formaldehyde 
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in PBS with 50% heptane. Embryos were washed with 100% ethanol, transfenred to PBT and 
stained with mAb 22C10 as previously described (Kania et al., 1995) to detect PNS neurons. 
Chordotonal neurons were idratified by their distinct morphology and position. 

[0241] Expressing Mathl during pupal development by heat shock using the 
UAS-Gal4 system (Brand and Perrimon, 1993) resulted in supernumerary external soise 
organs on the notum (Fig. 14 A,B) and the wing blade, as reported for ato (Jarman et al., 
1993) and the Achaete-Scute complex (AS-C) genes (Brand and Perrimon, 1993; Rodriguez 
et al., 1990). Mathl expression in flies, like ato, produced ectopic chordotonal organs (Fig. 
8G), although with less efficiency. Overexpression of the AS-C genes does not, however, 
resuh in ectopic chordotonal organs (Jarman et al., 1993). Mathl thus has a similar 
functional specificity to ato. 

[0242] Since several ato enhancers are ato-dependent (Sun et al., 1998), they can 
be activated by Mathl, which would then lead to ectopic CHO specification. To determine 
whether Mathl can substitute for ato function in the fly, and to rule out the possibility that 
production of CHOs by Mathl is due to ato activation, Mathl was expressed in ato mutant 
embryos. The mutants lack all chordotonal neurons (Fig. 14C), but overexpressing Mathl 
partially rescues tiie loss of these neurons (Fig. 14D) in a manner similar to ato (Chien et al., 
1996). 

EXAMPLE 13 

SIGNIFICANCE OF ATONAL AND MATHl IN THE CNS AND PNS 
[0243] Over the past few years significant progress has been made towards 
unraveling the roles of bHLH proteins in vertebrate neurogenesis. Neural vertebrate bHLH- 
encoding genes were isolated and characterized because Drosophila homologues such as ato 
or the AS-C genes had been previously shown to be required for neurogenesis (Anderson, 
1995; Guillemot, 1046 1995; Lee, 1997; Takebayashi et al., 1997). Indeed, several genes 
were shown to be proneural because their absence caused a failure of neuroblast or sensory 
organ precursor (SOP) specification, whereas their overexpression lead to the recruitment of 
supernumerary neuronal precursors (Ghysen and Dambly-Chaudiere, 1989). With the 
exception of neurogcnin (Ngn) 1 and 2 (Fode et al., 1998; Ma et al., 1998), it remains 
uncertain which of the vertebrate homologues play roles similar to their Drosophila 
counterparts, and what precise role different bHLH proteins play in neural development. In 
Drosophila, ato is required for the development of a specific subset of sense organs, the 
chordotonal organs (Jannan et al., 1993). CHOs are internal mechanosensors of the PNS 
(Mclver, 1985). Thus, ato and the CHOs provide an excellent system in which to ascertain 
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not only the molecular and developmental relationship between invertebrate and vertebrate 
neurogenesis vis-^-vis the function of the proneural genes, but also the evolutionary 
conservation of sensory organ function and specification. Seven ato homologues have been 
cloned and analyzed in the mouse: Mouse Atonal Homologues (MATH) 1, 2, 3, 4A (also 
known as Ngn2), 4B (Ngn3), AC (Ngnl), and 5 (Akazawa et al, 1995; Bartholoma and Nave, 
1994; Ben-Arie et al, 1997, 1996; Fode et al, 1998; Ma et al, 1998; McCormick et al, 
1996; Shimizu et al, 1995; Takebayashi et al, 1997). Most are expressed during 
neurogenesis in both the CNS and PNS. These homologues vary in the degree of their 
sequence conservation, and can be divided into three groups. The most distantly related 
group, the neurogenins, includes Ngn 1, 2 and 3. These gene products share, on average, 53% 
identity in the bHLH domain with ATO. They are expressed largely in mitotic CNS and 
sensory ganglia progenitor cells. Recent work suggests that these genes can play a role in 
neuroblast determination, and can therefore be true proneural genes (Fode et al, 1998; Ma et 
al, 1998). The second group includes MATH2 and MATH3, which share 57% identity in the 
bHLH domain with ATO. These proteins have been postulated to function in postmitotic 
neural cells (Bartholoma and Nave, 1994; Shimizu et al, 1995). Math2 expression is 
confined to the CNS, while Math3 is expressed in both the CNS and the trigeminal and dorsal 
root ganglia. The third group includes MATHl and MATHS, arguably the only true ato 
homologues by amino acid sequence criteria, sharing 67% and 71% identity with the bHLH 
domain of ATO, respectively. It is noteworthy that both genes encode a basic domain 
identical to that of ATO. Interestingly, the basic domain of ATO was shown to be sufficient, 
in the context of another proneural protein (SCUTE), to substitute for the loss of ato function 
(Chien et al, 1996). Mathl was initially shown to be expressed in the precursors of the 
cerebellar EGL and in the dorsal spinal cord (Ben-Arie et al, 1997, 1996). Math5 is 
expressed in the dividing progenitors in the developing retina and in the vagal ganglion 
(Brown et al, 1998). With the exception of MathS expression in the neural retina, these 
observations pose a paradox: none of the vertebrate homologs appeared to be expressed in 
peripheral organs or tissues similar to those where ato is expressed. Jarman et al (1993) 
reported that ato is expressed in the CNS. In the examples described herein it is shown that, 
in addition to the iimer proliferation center of the optic lobe, ato is expressed in a small 
anteriomedial patch of cells in each brain lobe (Fig. 8F). Because it remains unclear, 
however, precisely what role ato plays in Drosophila CNS development, it has been difRcult 
to argue that ato and its vertebrate homologues display functional conservation. The 
experiments presented herein reveal sites of previously uncharacterized Mathl expression. As 
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expected, MathlllacZ expression in the CNS corresponds to that of Mathl, but Mathl is also 
expressed in the skin, the joints, and the inner ear, in striking parallel to ato expression in the 
fly. Moreover, the expression in the ear (sensory epithelium) and the skin (Merkel cells) is 
restricted to sensory structures whose function is to convert mechanical stimuli into neuronal 
electrochemical signals. It is important to point out that in Drosophila, ato appears to play 
two roles simultaneously. It is required not only to select the precursors of the CHOs 
(proneural role), but also to specify these precursors as CHO precursors (lineage identity role) 
(Jaraian and Ahmed, 1998; Jarman et a/., 1993). The specificity of Mathl expression in the 
periphery makes it tempting to speculate that it, too, can endow specific cells with very 
specific lineage identities to distinguish them fiinctionally firom other sensory structures. The 
ability of Mathl to induce ectopic CHO formation and to restore CHOs to ato mutant 
embryos supports the notion that Mathl, and particularly its basic domain, encodes lineage 
identity information not unlike that encoded by ato. This suggests that the mammalian cells 
expressing Mathl, at least in the ear and the skin, are fiinctionally similar and perhaps 
evolutionarily related to Drosophila cells that require ato. Furthermore, MathS expression in 
the neural retina suggests that the fimctions of atonal in the fly are canied out by two genes 
in the mouse: the development of some mechanoreceptors is under the control of Mathl and 
retinal development is possibly under the control of MathS. It is interesting to note that in the 
fiiUy sequenced nematode C. elegans, only one homolog of atonal, lin-32, was identified 
(Zhao and Emmons, 1995). Mutants with the u282 allele of lin-32 are touch-insensitive, 
which strengthens the argument for evolutionary conservation of atonal fimction in 
mechanoreception. The pattern of Mathl/lacZ expression in the pontine nuclei suggested this 
region should be carefiiUy evaluated in null mutants. Although no defects in the pons of 
Mathl null mice (Ben-Arie et al, 1997) were originally detected, closer analysis revealed the 
lack of pontine nuclei at this site. These neurons derive fi^m the rhombic lip (Altman and 
Bayer, 1996) as do the EGL neurons, which are also. lacking in Mathl null mice. While it is 
possible to draw parallels between Mathl and ato expression in the skin and ear, it is not 
clear that such is the case for the joints, ato expression in the fly joints is required for the 
foraiation of leg CHOs. In contrast, Mathl is expressed in resting and articular chondrocytes 
that do not have any described neural fijnction, and for which no parallels exist in the fly. It 
can be that Mathl expression in cartilage indicates a novel role for a mechanosensory gene, 
or it can simply reflect similarities in the molecular events underlying the development of the 
various Afa/A/ -expressing cell types. Alternatively, CHOs can also fimction as joint structural 
elements in the fly, or articular cartilage can have a mechanoreceptive or transducive capacity 
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yet to be described. There is no evidence at this point to support one or another of these 
possibilities. Analyzing the functions of ato and MathJ will enhance the understanding of 
neural development and the evolutionary conservation of sensory function. The sites and 
specificity of Mathl expression can make it suitable as a tool of gene therapy or gene 
activation approaches to illnesses such as hearing loss and osteoarthritis that are due to age- 
related or environmental damage. 

EXAMPLE 14 

ATONAL-ASSOCIATED NUCLEIC ACID DELIVERY USING ADENOVIRUS 

[0244] Human adenoviruses are double-stranded DNA tumor viruses with 
genome sizes of approximate 36 kb. As a model system for eukaryotic gene expression, 
adenoviruses have been widely studied and well characterized, which makes them an 
attractive system for development of adenovirus as a gene transfer system. This group of 
viruses is easy to grow and manipulate and they exhibit a broad host range in vitro and in 
vivo. In lytically infected cells, adenoviruses are capable of shutting off host protein 
synthesis, directing cellular machineries to synthesize large quantities of vural proteins, and 
producing copious amounts of virus. 

[0245] The El region of the genome includes ElA and ElB, which encode 
proteins responsible for transcription regulation of the viral genome, as well as a few cellular 
genes. E2 expression, including E2A and E2B, allows synthesis of viral replicative 
functions, e.g. DNA-binding protein, DNA polymerase, and a terminal protein that primes 
replication. E3 gene products prevent cytolysis by cytotoxic T cells and tumor necrosis 
factor and appear, to be important for viral propagation. Functions associated with the E4 
proteins include DNA replication, late gene expression, and host cell shutofif. The late gene 
products include most of the virion capsid proteins, and these are expressed only after most of 
the processing of a single primary transcript Gcom the major late promoter has occurred. The 
major late promoter (MLP) exhibits high efficiency during the late phase of the infection. 

[0246] As only a small portion of the viral genome appears to be required in cis, 
adenovirus-derived vectors offer excellent potential for the substitution of large DNA 
fiagments when used in connection with cell lines such as 293 cells. Ad5 -transformed 
human embryonic kidney cell lines have been developed to provide the essential viral 
proteins in trans. The inventors thus reasoned that the characteristics of adenoviruses 
rendered them good candidates for use in targeting Mathl deficient cells in vivo. In another 
embodiment these constructs include a Hathl or any o/onaZ-associated nucleic acid sequence. 
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[0247] Particular advantages of an adenovirus system for delivering foreign 
proteins to a cell include: (i) the ability to substitute relatively large pieces of viral DNA by 
foreign DNA; (ii) the structural stability of recombinant adenoviruses; (iii) the safety of 
adenoviral administration to humans; (iv) lack of any known association of adenoviral 
infection with cancer or malignancies; (v) the ability to obtain high titers of the recombinant 
viras; and (vi) the high infectivity of Adenovirus. 

[0248] One advantage of adenovirus vectors over retroviruses is a higher level of 
gene expression. Additionally, adenovirus replication is independent of host gene replication, 
unlike retroviral sequences. Because adenovirus transforming genes in the El region can be 
readily deleted and still provide efficient expression vectors, oncogenic risk from adenovirus 
vectors is thought to be negligible. 

[0249] In general, adenovirus gene transfer systems are based upon recombinant, 

engineered adenovirus that is rendered replication-incompetent by deletion of a portion of its 

genome, such as El, and yet still retains its competency for infection. Relatively large 

foreign proteins can be expressed when additional deletions are made in the adenovirus 

genome. For example, adenoviruses deleted in both El and E3 regions are capable of 

carrying up to 10 Kb of foreign DNA and can be grown to high titers in 293. Surprisingly, 

persistent expression of transgenes following adenoviral infection is possible. Use of the 

adenovirus gene transfer system can be more useful for the delivery of Mathl to cells in 

nascent or damaged cartilage in joints. In particular, the Mathl adenovirus can be used to 

deliver Mathl, and confer Mathl gene expression in, non-ossified joint cartilage that has 

been damaged as a consequence of osteoarthritis. 

EXAMPLE 15 
M4riyj- ADENOVIRUS CONSTRUCTS 

[0250] Recombinant virions for the controlled expression of Mathl can be 
constructed to exploit the advantages of adenoviral vectors, such as high titer, broad target 
range, efficient transduction, and non-integration in target cells for the transformation of cells 
into hair cells. In a specific embodiment these constructs include a Hathl or any atonal- 
associated nucleic acid sequence. In one embodiment of the invention, a replication- 
defective, helper-independent adenovirus is created that expresses wild type Mathl sequences 
under the control of the human cytomegaloviras promoter or the metallothionine promoter. 

[0251] Control functions on expression vectors are often provided from viruses 
when expression is desired in mammaUan cells. For example, conunonly used promoters are 
derived from polyoma, adenovirus 2 and simian virus 40 (SV40). The early and late 
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promoters of SV40 virus are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the SV40 viral origin of replication. Smaller or 
larger SV40 fragments can also be used provided there is included the approximately 250 bp 
sequence extending from the Hindm site toward the Bgll site located in the viral origin of 
replication. Further, it is also possible, and often desirable, to use promoter or control 
sequences normally associated with the Mathl gene sequence, namely the Mathl promoter, 
provided such control sequences are compatible with the host cell systems or the target cell- 
One such target cell is located in the inner ear of a human patient in need of inner ear hair 
cells. 

[0252] An origin of replication can be provided by construction of the vector to 
include an exogenous origin, such as can be derived from SV40 or other viral (e.g., polyoma, 
adeno, VSV, BPV) source, or can be provided by the host cell chromosomal replication 
mechanism. If the vector is integrated into the host cell chromosome, the latter is often 
sufficient. 

EXAMPLE 16 

ATONAL-ASSOCIATED NUCLEIC ACID DELIVERY USING RETROVIRUS 

[0253] Another approach for gene delivery capitalizes on the natural ability of 
viruses to enter cells, bringing their own genetic material with them. Retroviruses have 
promise as gene delivery vectors due to their ability to integrate their genes into the host 
genome, transferring a large amount of foreign genetic material, infecting a broad spectrum 
of species and cell types and because they are easily packaged in special cell-lines. 
Retroviruses can be particularly use&l for the delivery oi Mathl into inner ear hair cells that 
have reduced expression of Mathl, or that are in need of over-expression of Mathl . In 
another embodiment these constructs include a Hathl or any atonal-associated nucleic acid 
sequence. 

EXAMPLE 17 
MATHl RETROVIRAL CONSTRUCTS 

[0254] The Mathl open reading frame (ORF) was excised from pBluescript by an 
EcoR I-Xbal digest. The fragment was gel purified, and blunt ended using Klenow DNA 
polymerase. The retroviral vector pLNCX (purchased from CLONTECH) was linearized 
with Hpal, and ligated with the Mathl ORF fragment. The ligation was transformed into 
transformation competent E, coli cells. The resulting antibiotic resistant colonies were 
assayed for the presence of the correct construct. 
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[0255] The cloning, reproduction and propagation retroviral expression vectors 
are well known to those of skill in the art. One example of a retroviral gene transfer and 
expression system that has been used to express Mathl is the CLONTECH pLNCX, pLXSN 
and LAPSN expression vectors. For propagation of these vectors PT67 and EcoPack 
packaging cell lines can be used. For more information on mammalian cell culture, the 
following general references can be used: Culture of Animal Cells, Third Edition, edition by 
R. L Freshney (Wiley-Liss, 1993); and Current Protocols in Molecular Biology, ed. By F. M. 
Ausubel, et al, (Greene Publishing Associates and Wiley & Sons, 1994), relevant portions 
incorporated herein by reference. 

[0256] In another embodiment constmcts can be generated which include a Hathl 
or any afo/iaZ-associated nucleic acid sequence. 

EXAMPLE 18 
MAINTENANCE OF PACKAGING CELL LINES 

[0257] The maintenance of packaging cell lines, such as the 293 and PT67 
packaging cell lines, is described briefly. A vial of frozen cells is transferred from liquid N2 
to a 37° C water bath until just thawed. In order avoid osmotic shock to the cells, and to 
maximize cell survival, 1 ml of (Dulbecco's Modified Eagle Medium) DMEM is added to the 
tube and the mixture is transfenred to a 15-ml tube. Another 5 ml of DMEM is added and the 
cells are mixed. After repeating these steps the final volume in the tube should be about 12 
ml. Next, the cells are centrifuged at 500 x g for 10 min. Finally, the supernatant is removed 
and the cells are resuspended in maintenance media as described in the next step. Generally, 
the cells are maintained in DMEM (high glucose: 4.5 g/L) containing 10% Fetal Bovine 
Serum (FBS), and 4 mM L-glutamine. If desired or necessary, 100 U/ml penicillin/100 g/ml 
streptomycin can be added. It is recommended that are plated at 3-5 x 10^ per 100-mm plate 
and split every 2 to 3 days, when they reach 70-80% confluency (confluence is 3-4 x 10*^ per 
100-mm plate). The PT67 cell line, for example, has a very short doubling time (<16h) and 
should be split before they become confluent. The doubling time for EcoPack-293 cells is 
24-36 h. 

[0258] Cells are split be removing the medium and washing the cells once with 
PBS. After treatment with 1-2 ml of trypsin-EDTA solution for 0.5-1 min, 5 to 10 ml of 
media and serum is added to stop trypsinization. The cells are dispersed gently, but 
thoroughly, by pipetting and are resuspended. Alternatively, a predetemined portion of the 
cells is replated in a 100-mm plate in 10 ml of medium, followed by rotation or shaking of 
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the plate to distribute the cells evenly. A ratio of up to 1:20 for the PT67 or EcoPack-293 
cells is common. 

[02591 Generally, the percentage of PT67 or EcoPack-293 cells capable of 

packaging retroviral vectors decreases slowly with continued passage of the cell line. 

Therefore, packaging cells should be reselected after 2 months of growth in culture. 

Alternatively, new high-titer cells can be purchased from, e.g., CLONTECH, or low passage 

nimiber stocks can be frozen, stored and thawed to increase the viral yield. 

EXAMPLE 19 
METHODS UTILIZING A RETROVIRAL VECTOR 

[02601 The following protocol is used to transfect the retroviral vector for virus 
production, infection of target cells, and selection of stable clones. Other methods and 
vectors can also be used with the present invention to express Mathl, such as those described 
in Retroviruses, ed. by J. M. Coffin & H. E. Varmus (1996, Cold Spring Harbor Laboratory 
Press, NY) and Current Protocols in Molecular Biology, ed. by F. M. Ausubel et al. (1994, 
Greene Publishing Associates and Wiley & Sons), incorporated herein by reference. In 
another embodiment these constructs include a Hathl or any atonaZ-associated nucleic acid 
sequence. 

[02611 Briefly, the transfection of the retroviral vector into PT67 cells was as 
follows. Mathl was cloned into pLNX as described hereinabove. The packaging cells were 
plated to a density of 5-7 x lO' cells per 100-mm plate 12-24 hours before transfection. 1-2 
hours before transfection, the medium replace with fresh medium. 25 M chloroquine can be 
added just prior to transfection. Chloroquine increases transfection efficiency 2-3 fold. A 25 
mM stock solution of chloroquine can be made in distilled water and filter sterilized 

[02621 To each 100-mm plate 10-15 g of plasmid DNA using the desired method 
is transfected using, e.g., standard calcium-phosphate procedures (CalPhos Mammalian 
Transfection Kit, #K2050-1). The final volume of transfection mixture should not exceed 1 
ml. The transfection solution is added to the medium and the plate is rotated to ensure even 
distribution. About 8 hours after transfection, a glycerol shock treatment can be performed to 
increase the uptake of DNA. After 10 to 24 hours post-transfection the medium was removed 
and the cells were washed twice with PBS, before adding 5 ml DMEM containing 10% FBS. 
The culture was incubated for an additional 12-48 hours to allow increase in virus titer. The 
virus titer reaches a maximum 48 hours post-transfection and is generally at least 30% of 
maximum between 24 and 72 hours post-transfection. 
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[0263] Alternatively, a stable virus-producing cell lines can also be selected. To 
obtain stable virus-producing cell lines, the transfected packaging cells are plated in a 
selection medium 2-3 days post-transfection. For G418 selection of neomycin resistance, the 
cells are selected in the presence of G418 (0.5 mg/ml "active") for one week. Vectors 
carrying other selectable markers such as Puro, Bleo, or Hyg, can be used to obtain stable 
virus producing cell populations as well. Cell populations producing virions that produce 
titers of 105-106 recombinant virus particles per ml are common. Generally, lO'-lO* 
recombinant virus particles per ml is suitable for most purposes. For some studies, higher 
titer clones can be required. In this case, after antibiotic selection, individual clones are 
selected using, e.g., clone cylinders or limiting dilution, prior to propagation. Viral titer 
can be determined in a variety or ways, one such method is described hereinbelow. The viral 
titer produced by transiently transfected or stable virus-producing packaging cell lines is 
determined as follows, NIH/3T3 cells are plated one day prior to beginning the titer 
procedure. Cells are plated in 6-well plates at a density of 5 x 10*-1 x 10^ cells per well and 4 
ml of media are added per well. Viras-containing medium is collected from packaging cells, 
and polybrene is added to a final concentration of 4 g/ml. The medium is filter-sterilized 
through a 0.45- m filter. Polybrene is a polycation that reduces the charge repulsion between 
the virus and the cellular membrane. The filter should be cellulose acetate or polysulfonic 
(low protein binding) but not nitrocellulose. Nitrocellulose binds proteins in the retroviral 
membrane, and consequently destroys the virus. Serial dilutions are prepared as follows: six 
10-fold serial dilutions are usually sufficient. To dilute the virus, fresh medium containing 4 
g/ml of polybrene is utilized. Next, NIH/3T3 target cells are infected by adding virus- 
containing medium to the wells. After 48 hours, the NIH/3T3 cells are stained. The titer of 
virus corresponds to the number of colonies present at the highest dilution that contains 
colonies, multiplied by the dilution factor. For example, the presence of four colonies in the 
105 dilution would represent a viral titer of 4 x 10^. 

[0264] For the infection of cells, the following procedure was followed. The 
target cells were plated 12-18 hours before infection at a cell density of 3-5 x lO' per 100-mm 
plate. For the infection of cells that can be used for a biological assay, control cells can be 
treated with an insert-free virus produced under identical conditions. Half-maximal infection 
generally occurs after 5-6 hours of exposure of cells to virus, with maximal infection 
occurring after approximately 24 hours of exposure. The actual reverse transcription and 
integration of the retrovirus takes place within 24-36 hours of infection, depending on cell 
growth kinetics. Expression can be observed at 24 hours, and reaches a maximum at 
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approximately 48 hours. Alternatively, infections can be conducted sequentially, about 12 

hours apart. Sequential infection generally increases the efficiency of infection and also 

increases viral copy number. A minimum of 12 hours between each infection is 

recommended in order to ensure that cellular receptors will be unoccupied by viral envelope. 

EXAMPLE 20 
SCREENING ASSAYS 

[0265] Finally, the present invention also provides candidate substance screening 
methods that are based upon whole cell assays, in vivo analysis and transformed or immortal 
cell lines in which a rq)orter gene is employed to confer on its recombinant hosts a readily 
detectable phenotype that emerges only under conditions where Mathl would be expressed, 
is under-expressed or is over-expressed. Generally, reporter genes encode a polypeptide not 
otherwise produced by the host cell that is detectable by analysis, e.g., by chromogenic, 
fluorometric, radioisotopic or spectrophotometric analysis. In the present invention the 
Mathl gene has been replaced with p-galactosidase in a mouse. 

[0266] An example of a screwing assay of the present invention is presented 
herein. MathJ expressing cells are grown in microtiter wells, followed by addition of serial 
molar proportions of the small molecule candidate to a series of wells, and determination of 
the signal level after an incubation period that is sufficient to demonstrate, e.g., cabretinin 
expression in controls incubated solely with the vehicle used to resuspend or dissolve the 
compound. The wells containing varying proportions of candidate are then evaluated for 
signal activation. Candidates that demonstrate dose related enhancement of reporter gene 
transcription or expression are then selected for further evaluation as clinical therapeutic 
agents. The stimulation of transcription can be observed in the absence of expressed MathJ, 
in which case the candidate compound might be a positive stimulator of hair cell 
differentiation. Alternatively, the candidate compound might only give a stimulation in the 
presence of low levels of Mathl, which would suggest that it functions to stabilize tiie 
formation of Mathl dimers or the interaction of Mathl with one or more transcriptional 
factors. Candidate compounds of either class might be useful therapeutic agents that would 
stimulate production of inner ear hair cells and thereby address the need of patients with 
hearing loss or balance control impairments. 

EXAMPLE 21 

TRANSFECTION OF CELLS WITH MATHJ RETROVIRAL VECTORS 

[0267] The present invention provides recombinant host cells transformed or 
transfected with a polynucleotide that encodes Mathl, as well as transgenic cells derived 
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from those transformed or transfected cells. In another embodiment these constructs include 
a Hathl or any atonaZ-associated nucleic acid sequence. Preferably, a recombinant host cell 
of the present invention is transfected with a polynucleotide containing a functional Mathl 
nucleic acid sequence or a chimeric Mathl gene. Methods of transforming or transfecting 
cells with exogenous polynucleotides, such as DNA molecules, are well known in the art and 
include techniques such as calcium-phosphate- or DEAE-dextran-mediated transfection, 
protoplast fusion, electroporation, liposome mediated transfection, direct microinjection and 
adenovirus infection. 

[0268] Mathl expression using recombinant constructs can be used to target the 
delivery of Mathl to cells in need thereof Different promoter-vector combinations can be 
chosen by a person skilled in these arts to drive Mathl expression in different cell types. In 
some cases, the desired outcome can not be protein, but RNA, and recombinant vectors 
would include those with inserts present in either forward or reverse orientations. In addition, 
some vectors, for instance retroviruses or artificial recombination systems, can be designed to 
incorporate sequences within a cellular or viral genome in order to achieve constitutive or 
inducible expression of protein or RNA. 

[0269] Many of the vectors and hosts are available commercially and have 
specific features that facilitate expression or subsequent purification. For instance DNA 
sequences to be expressed as proteins often appear as fusion with unrelated sequences that 
encode polyhistidine tags, or HA, FLAG, myc and other epitope tags for immunochemical 
purification and detection, or phosphorylation sites, or protease recognition sites, or 
additional protein domains such as glutathione S-transferase (GST), maltose binding protein 
(MBP) (New England Biolabs), and so forth that faciUtate purification. Vectors can also be 
designed that contain elements for polyadenylation, splicing, and termination, such that 
incorporation of naturally occurring genomic DNA sequences that contain introns and exons 
can be produced and processed, or such that unrelated introns and other regulatory signals 
require RNA processing prior to production of mature, translatable RNAs. Proteins produced 
in the systems described above are subject to a variety of post-translational modifications, 
such as glycosylation, phosphorylation, nonspecific or specific proteolysis or processing. 

EXAMPLE 22 

DELIVERY OF MATHl AS AN AMINO ACID SEQUENCE 

[0270] A peptide (1 1 amino acids) derived fi^om HIV has been recently described 
that when fused to full length proteins and injected into mice allow a rapid dispersal to the 
nucleus of all cells of the body (Schwarze et al, 1999). Schwarze et al. made fusion proteins 
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to Tat ranging in size from 15 to 120 kDa. They documented a rapid uptake of the fusion 
proteins to the nuclei of cells throughout the animal, and the functional activity of said 
proteins was retained. 

[0271] In an embodiment of the present invention there are constructs containing 
the Tat or Tat-HA nucleic acid sequence operatively linked to a Mathl nucleic acid sequence. 
In another embodiment these constructs include a Hathl or any a/o/ia/-associated nucleic 
acid sequence. The vectors are expressed in bacterial cultures and the fusion protein is 
purified. This purified Tdt-Mathl protein or Tat-Hathl protein is injected into animal to 
detemiine the efficiency of the Tat delivery system into the inner ear, skin, cerebellum, brain 
stem, spinal cord and joints. Analysis is carried out to determine the potential of the Tat- 
MathirSdX-Hathl protein in hair cell and neuronal regeneration. This is a viable therapeutic 
approach either in its own right or in association with other methods or genes. 

[0272] It should be understood that the methods to screen for compounds which 
affect Mathl expression disclosed herein are useful notwithstanding that effective candidates 
can not be found, since it is of practical utility to know what upstream effector is necessary 
for Mathl transcription. 

EXAMPLE 23 

MATHI IS REQUIRED FOR SECRETORY CELL LINEAGE 
COMMITMENT IN INTESTINE 

[0273] Two null alleles were utiUzed for Mathl : Mathl''' (with the coding region 
replaced by Hprt) and Mathl^-°^^^^ (with the coding region replaced by the p-galactosidase 
gene, which is then expressed under the control of the Mathl promoter) (Ben-Arie et al, 
2000). Mathl null mice die shortly after birth, but Mathl heterozygous mice survive to 
adulthood and appear normal. It was previously shown that Mathl/LacZ expression faithfully 
mimics the endogenous gene expression (Ben-Arie et al., 2000). Herein Mfl/Ay^"^'^" was used 
instead of Mathl^'^^^'°^^ null mice for X-gal staining experiments to ensure equal copy 
numbers of the LacZ gene in heterozygous and null animals. Mathl/LacZ expression within 
the gut is restricted to the intestinal epithelium starting at El 6.5 and is sustained until 
adulthood. 

[0274] X-gal staining of adult intestines was performed as described 
(Sts^penbeck and Gordon, 2000); for embryos, 10-^mi sections from frozen blocks of 4% 
paraformaldehyde-fixed intestinal tissue were stained overnight at 37*C in a pH 8.0 solution 
containing 1.3 mM MgCb, 15 mM NaCl, 44 mM Hepes buffer (pH 7.3), 3 mM potassium 
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feiri-cyanide, 3 mM potassium ferrocyanide and 0.05% X-gal. Sections were counterstained 
with nuclear fast red. 

[0275] No Mathl /LdicZ expression was detected in the stomach, pancreas, or lung. 
In E18.5 heterozygous mice, LacZ-positive cells are sparsely scattered in the villi, the 
intervillus epithelium (Fig. 15 A), and colonic crypts (Fig. 15C). In Mathl null littermates, 
however, LacZ-expressing cells are clustered in the intervillus region of ileum (Fig. 15B) and 
at the bases of the colonic crypts (Fig. 15D). Mathl fL&cZ expression persists throughout 
duodenum, jejunum, ileum, and colon (Fig. 15, 15E and 15F; Fig. 16) in adult Mathl^' 
Gavp-Gai ^ ^^jj^ jj^g scattered blue cells appear to have a goblet cell morphology (a 
spherical vacuole); at the base of the crypt, most apical granule-containing Paneth cells 
^pear to be LacZ-positive. X-gal stained cells are also found in the mid-crypt region. LacZ 
expression in adult crypts suggests that Mathl helps initiate cytodifferentiation of the 
epitheUal cells. No MathlJL&cZ expression was detected in the enteric nervous system 
(intestinal) from E14.5 to adult. An acetylcholinesterase activity assay (Blaugrund et al., 
1996) revealed no gross abnormalities in the enteric neurons, although subtle deficits may not 
be q)parent at these resolutions. 

[02761 The small and large intestines of Mathl null embryos (E14.5 to El 8.5) 
showed normal villus architecture, lamina propria, and musculature, but no goblet cells (Fig. 
17, A and B). In wild-type animals Alcian blue-positive goblet cells increased in number 
along the duodenal-colonal axis (Fig. 17C), but were not detected in Mathl^'^'"' mice (Fig. 
17D). The enteroendocrine lineage in the gut epithelium was then analyzed. 

[0277] HematoxyUn and eosin or Alcian blue and neutral red staining and 
immunohistochemistry were performed according to standard protocols. The source and final 
dilution of the primary antibodies were as follows: rabbit chromogranin A antibody (1:2000), 
gastrin antibody (1:300), glucagon antibody (1:2000), serotonin antibody (1:20000), 
somatostatin antibody (1:4000), neurotensin antibody (1:2500) are from DiaSorin; rabbit 
synaptophysin antibody (1:200, Bio-Genex), and rabbit Ki-67 antibody (1:1000, Novocastra). 
For EM, different regions of El 8.5 intestines were fixed in 3% phosphate-buffered 
glutaraldehyde and post fixed in phosphate-buffered osmium tetroxide. Specimens were 
dehydrated and embedded in Araldyte 502 resin. Semithin sections (0.4 ^m) were stained 
with methylene blue and basic fiichsin. Thin sections (60 nM) were stained with uranyl 
acetate and lead citrate. The samples were observed on a JEOL 1210 electron microscope. 
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[0278] Neither panendocrine markers (chromograiun A, synaptophysin) nor 
specific endocrine markers (glucagon, gastrin, somatostatin, neurotensin, and serotonin) were 
detectable in any regions of Mathl^'°'^~ null mouse intestine (Fig. 17F; cf. wild type. Fig. 
17E). Electron microscopy (EM) on El 8.5 embryos revealed no granular or common goblet 
or enteroendocrine cells in any region of Math! null mouse intestines (Fig. 18B, cf. 
wild type in ISA). Null mouse enterocytes, however, had a normal microvillus brush border: 
strongly positive for alkaline phosphatase and lactase, ample endoplasmic reticulum, a few 
secondary lysosomes, and regular columnar height with uniform nuclei close to the inner 
aspect of the cell (Fig. 18B; Fig. 19). Some mutant enterocytes have abundant glycogen, like 
immature enterocytes, whereas wild-type enterocytes no longer have cytoplasmic clusters. 
Electron microscopy cannot be used to evaluate Paneth cells in Mathl null animals, because 
their characteristic apical granules do not mature until after birth (Stappenbeck and Gordon, 
2000). But cryptdin-1 is one of the earliest markers expressed in Paneth cells, starting at 
E15.5 (Bry et al., 1994), so its expression was examined. 

[0279] RNA was extracted from E18.5 intestine using TRIzol (Gibco BRL) 
according to manufacturer's instructions. cDNA synthesis was performed as described 
(Jensen et al, 1998). Cryptdin and glucose-6-phosphate dehydrogenase (G6PDH) primers 
and PGR were as previously described (Daimoul et al, 1997; Jensen et al, 2000), except for 
the thermal cycle pro file: a single denaturing step at 96 "C for 1 min followed by 25 cycles 

of 96 "C for 30 s; 55 for 30 s;73 °G for 1 min. - 

[0280] Cryptdin-1 consensus primers were used to amplify a 272-bp product 
corresponding to nucleotides 80 to 352 (Darmoul et al, 1997). Cryptdin-1 expression was 
detected in wild-type duodenum, jejunum, and ileum but was completely absent in these three 
regions in Mathl null animals (Fig. 18C). As expected (Bry et al, 1994), no cryptdin- 
positive cells were detected in wild-type or Mathl null colon (Fig. 18C). Neither EM nor 
tunnel assays revealed signs of premature cell death in Mathl null gut (Fig. 18B). 

[0281] In adult crypts, epithelial stem cells and multipotent progenitor cells are 
proliferating and show nuclear staining for Ki-67, a cell proliferation marker (Korinek et al. 
1998) (Fig. 18D and 18E). In the Mathl^"^^^ mice, the lacZ-expressing cells show a 
cytoplasmic blue staining pattern (Ben-Arie et al, 2000). This feature permits colocalization 
of JWarA/Zp-galactosidase and Ki-67. In the crypts, double-positive cells are scattered bom 
the 4th to 13th cell position firom the base of the small intestine and are in the 2nd to 4th 
position in the colon (Fig. 18D and 18E; Fig. 20). Clearly not all Ki-67-positive cells express 
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MathJ, suggesting that iWa/Ai -negative progenitors give rise to the entero-cytes, whereas 
Mathl-expressmg progenitors become goblet, enteroendocrine, and Paneth cells. Upon 
deletion of Mathl, the latter group of cells fail to differentiate, and their progenitors remain in 
the proliferating stage, thus accounting for the intense X-gal staining seen in the crypts of null 
embryos (Fig. 15B and 15D). To ascertain the effects of Mathl deletion on the proliferative 
status of the secretory lineage progenitors, 2500 Ki-67-positive cells were examined in three 
pairs of El 8.5 Mathl null and heterozygous mice for LacZ-positive staining. Double-positive 
cells were scored as a fraction of the total cycling Ki-67-positive population in El 8.5 Mathl 
heterozygous and null mouse intestines. The ratio of double-positive to Ki-67-positive cells 
m Mathl null animals, from duodenum to colon, was roughly three times that seen in 
heterozygotes (25 to 68% versus 7 to 22%). supporting the hypothesis that cells lacking 
Mathl fail to exit the cell cycle and differentiate. Previous studies have shown that members 
of the Notch signaling pathway (e.g., Mashl, Neurogenin 3, and NeuroD) are involved in 
endocrine cell differentiation (Ito et al, 2000; Apelqvist et al, 1999; Rindl et al. 1999). 
Deletion of Hesl, a Notch signaling component that represses bHLH transcriptional 
activators, leads to an increased number of enteroendocrine and goblet cells but fewer 
enterocytes, and elevates expression of Deltal, Delta3, NeuroD, and Mathl in the small 
intestine (Jensen et al, 2000). Hesl also negatively regulates inner ear hair cell 
differentiation by suppressing Mathl (Zheng et al. 2000). These studies support the 
hypothesis that Mathl controls cell fate determination via a Delta-Notch signaling pathway. 
Quantitative reverse transcription-poly-merase chain reaction (RT-PCR) analysis revealed 
that Deltas was reduced to half of wild-type levels in Mathl null mice, and NeuroD 
expression was lost completely (Fig. 21 A). In contrast, Deltal, Hes-1, and Notchl, 2, 3, and 4 
expression levels and cellular localization of Hes-1 appeared unaffected (Fig. 21 A and Fig. 
22). These observations are consistent with previous findings that Mathl is upstream of 
NeuroD (Miyata et al, 1999) and the notion that Mathl has a positive feedback effect on 
Notch ligand {e.g., Delta3) expression. These findings provide new insight into the role of 
Notch-mediated lateral inhibition in controlling differentiation of intestinal epithelial 
lineages. Building on the model set forth by Bjerknes and Cheng (1999), in a specific 
embodiment a single self-maintaining stem cell gives rise to two daughter cells directly or 
through intermediate progenitors (Fig. 21B). In one daughter cell, interaction between Delta 
and Notch homologs elevates Hesl expression, inhibiting Mathl expression, and this cell 
adopts an enterocyte fate. In the other daughter cell, lack of Hesl expression increases Mathl 
expression, and this cell becomes a committed multipotent progenitor that will differentiate 
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into a secretory lineage cell (Fig. 21B). Further differentiation of the secretory lineage into 
goblet, enteroendocrine, and Paneth cells requires other factors. NeuroD has been shown to 
play a role in differentiation of the secretin and cholecystokinin enteroendocrine cells (Rindl 
et al., 1999); early committed multipotential endocrine cells can branch into at least three 
lineages (Fig. 2 IB) (Rindl et al., 1999). Racl is reported to play a positive role in goblet and 
Paneth cell differentiation but does not seem to have any impact on the enteroendocrine 
lineage (Stappenbeck and Gordon, 2000), suggesting that goblet and Paneth cells share a 
closer relationship during later development. Constitutively activated Racl causes precocious 
enterocyte growth, indicating its positive role in the absorptive cell lineage (Stappenbeck and 
Gordon, 2000). These observations suggest that there is cross talk between the Notch and 
Rho GTPase pathways during formation of the gut epithelium. In other specific 
embodiments, instead of arising from one A/afA/-positive progenitor, the goblet, en- 
teroendocrine, and Paneth cells may differentiate from three distinct progenitors that each 
exptsss MathJ . 
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[0283] One skilled in the art readily appreciates that the present invention is well 
adjq)ted to carry out the objectives and obtain the ends and advantages mentioned as well as 
those inherent therein. Sequences, mutations, complexes, methods, treatments, 
pharmaceutical compositions, procedures and techniques described herein are presently 
rq)resentative of the preferred embodiments and are intended to be exemplary and are not 
intended as limitations of the scope. Changes therein and other uses will occur to those 
skilled in the art which are encompassed within the spirit of the invention or defined by the 
scope of the pending claims. 
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What is claimed is: 

1. A method of predicting a differentiation state for a stem cell, comprising the 
steps of: 

obtaining the cell; 

determining the expression status of an ato/ia/-associated sequence. 

2. The method of claim 1 , wherein said stem cell is an intestinal stem cell. 

3. The method of claim 2, wherem the stem cell is obtained from an intestinal 
epithelium. 

4. The method of claim 1, wherein said expression status of said atonal- 
associated sequence is an upregulation of expression of said a/o«a/-associated sequence. 

5. The method of claim 4, wherein said differentiation state is to a secretory cell 
of the intestine. 

6. The method of claim 5, wherein said secretory cell is at least one of a goblet 
cell, an enteroendocrine cell, or a Paneth cell. 

7. The method of claim 1, wherein said expression status of said atonal- 
associated sequence is a downregulation of expression of said a/o/ia/-associated sequence. 

8. The method of claim 7, wherein said differentiation state is to an absorptive 
cell of the intestine. 

9. The method of claim 1, wherein said ato«a/-associated sequence is a 
polynucleotide. 

10. The method of claim 1, wherein said flto/ia/-associated sequence is a 
polypeptide. 

11. A method for differentiating a stem cell, comprising altering expression of an 
ato/2fl/-associated sequence. 

12. The method of claim 1 1, wherein said stem cell is a gastrointestinal stem cell. 

13. The method of claim 11, wherein said stem cell differentiates into a secretory 

cell. 

14. The method of claim 1 3, wherein said secretory cell is at least one of a goblet 
cell, an enteroendocrine cell, or a Paneth cell. 

15. The method of claim 11, wherein said stem cell differentiates into an 
absorptive cell. 
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16. A method of regenerating secretory intestinal cells in an individual, 
comprising the step of administering to the individual a stem cell and a regulatory factor for 
said stem cell, wherein the expression of an a/o/ia/-associated sequence is upregulated in the 
stem cell. 

17. The method of claim 16, wherein the secretory intestinal cell is at least one of 
a goblet cell, an enteroendocrine cell, or a Paneth cell. 

18. The method of claim 16, wherein the regulatory factor is a bone 
morphogenetic protein. 

19. The method of claim 18, wherein the bone morphogenetic protein is GDF7. 

20. A method of regenerating absorptive intestinal cells in an individual, 
comprising the step of administering to the individual a stem cell and a regulatory factor for 
said stem cell, wherein the expression of an ^tto/iaZ-associated sequence is downregulated in 
the stem cell. 

2 1 . The method of claim 20, wherein the regulatory factor is a member of the HES 

family. 

22. The method of claim 21, wherein the HES family member is Hesl, Hes2, 
Hes3, Hes4, Hes5, Hes6, Hes7, HERPl or HERP2. 

23. A method of treating an animal for a gastrointestinal condition, comprising 
delivering to the animal a gastrointestinal stem cell. 

24. - The method of claim 23, wherein the method further comprises delivery of a 
regulatory factor. 

25. A method of treating an animal for a gastrointestinal condition comprising 
delivering a therapeutically effective amount of an atowaZ-associated amino acid sequence or 
nucleic acid sequence to a cell of said animal. 

26. The method of claim 25, wherein said gastrointestinal condition is cancer, 
damaged intestinal tissue, inflammatory bowel disease, irritable bowel syndrome, infection or 
necrotizing entercolitis. 

27. The method of claim 25, wherein said aro/iaZ-associated amino acid sequence 
or nucleic acid sequence is Math J, 

28. The method of claim 25, wherein said afonaZ-associated amino acid sequence 
or nucleic acid sequence is Hathl. 

29. The method of claim 25 wherein said amino acid sequence or nucleic acid 
sequence is administered by a delivery vehicle. 
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30. The method of claim 29 wherein said delivery vehicle is an adenoviral vector, 
a retroviral vector, an adeno-associated viral vector, a plasmid, a liposome, a nucleic acid 
sequence, a peptide, a lipid, a carbohydrate or a combination thereof. 

31. The method of claim 29, wherein said delivery vehicle is selected from the 
group consisting of a viral vector or a non-viral vector. 

32. The method of claim 25, wherein said cell contains an alteration in an atonal- 
associated nucleic acid sequence or amino acid sequence. 

33. The method of claim 32, wherein said amino acid sequence has at least about 
80% identity to about 20 contiguous amino acid residues of SEQ ID NO:58 {Hathl). 

34. The method of claim 32, wherein said nucleic acid sequence encodes a 
polypeptide which has at least about 80% identity to about 20 contiguous amino acid residues 
ofSEQIDNO:58(i/a//2i). 

35. A composition in a pharmaceutical carrier, comprising: 

at least one stem cell, wherein the cell is upregulated for expression of an atonal- 
associated sequence; and 

at least one regulatory factor. 

36. The composition of claim 35, wherein the stem cell is a gastrointestinal stem 

cell. 

37. - A composition in a pharmaceutical carrier, comprising: 

at least one stem cell, wherein the cell is downregulated for expression of an atonal- 
associated sequence; and 

at least one regulatory factor. 

38. The composition of claim 37, wherein the stem cell is a gastrointestinal stem 

cell. 

39. A method of treating an individual for a gastrointestinal condition, comprising 
the step of administering to said individual a composition of claim 36 or claim 38. 

40. A method for screening for a compound in an animal, wherein said compound 
affects a detectable gastrointestinal condition in said animal, comprising: 

delivering said compound to said animal wherein at least one allele of an atonal- 
associated nucleic acid sequence in said animal is inactivated by insertion of a heterologous 
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nucleic acid sequence, wherein said heterologous nucleic acid sequence is under the control 
of an ato/ia/-associated regulatory sequence, and 

monitoring said animal for a change in the detectable gastrointestinal condition. 

41. The method of claim 40, wherein said delivery of said compound affects 
expression of said heterologous nucleic acid sequence, 

42. The method of claim 40 wherein said compound affects said detectable 
condition. 

43. A kit comprising an intestinal stem cell. 

44. The kit of claim 43, further comprising a regulatory protein. 

45. A method of treating an animal for a disease that is a result of loss of 
functional atonal-associated nucleic acid or amino acid sequence comprising delivering a 
therapeutically effective amount of an atonal-associated amino acid sequence or nucleic acid 
sequence to a cell of said animal. 

46. The method of claim 45, wherein said disease is a gastrointestinal disease. 
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SEQUENCE LISTING 

<110>ZOGHBI. HUDAY. 
YANG. Ql 

<120> COMPOSITIONS AND METHODS FOR THE THERAPEUTIC USE OF AN 
ATONAL ASSOCIATED SEQUENCE FOR A GASTROINTESTINAL CONDITION 

<130>P01899WO1 

<140> 10/004.717 
<1 41 > 2001-12-05 

<150> 09/585,645 
<1 51 > 2000-06-01 

<150> 60/176,993 
<1 51 > 2000-01-19 

<150> 60/137,060 
<1 51 > 1999-06-01 

<160>69 

<170>Patentln Ver.2.1 

<210> 1 
<211>1065 
<212> DNA 
<213> Homo sapiens 

<400> 1 

atgtcccgcc tgctgcatgc agaagagtgg gctgaagtga aggagttggg agaccaccat 60 
cgccagcccc agccgcatca tctcccgcaa ccgccgccgc cgccgcagcc acctgcaact 120 
ttgcaggcga gagagcatcc cgtctacccg cctgagctgt ccctcctgga cagcaccgac 180 
ccacgcgcct ggctggctcc cactttgcag ggcatctgca cggcacgcgc cgcccagtat 240 
ttgctacattccccggagct gggtgcctca gaggccgctg cgccccggga cgaggtggac 300 
ggccgggggg agctggtaag gaggagcagc ggcggtgcca gcagcagcaa gagccccggg 360 
ccggtgaaag tgcgggaaca gctgtgcaag ctgaaaggcg gggtggtggt agacgagctg 420 
ggctgcagcc gccaacgggc cccttccagc aaacaggtga atggggtgca gaagcagaga 480 
cggctagcag ccaacgccag ggagcggcgc aggatgcatg ggctgaacca cgccttcgac 540 
cagctgcgca atgttatccc gtcgttcaac aacgacaaga agctgtccaa atatgagacc 600 
ctgcagatgg cccaaatcta catcaacgcc ttgtccgagc tgctacaaac gcccagcgga 660 
ggggaacagc caccgccgcc tccagcctcc tgcaaaagcg accaccacca ccttcgcacc 720 
gcggcctcct atgaaggggg cgcgggcaac gcgaccgcag ctggggctca gcaggcttcc 780 
ggagggagcc agcggccgac cocgcccggg agttgccgga ctcgcttctc agccccagct 840 
tctgcgggag ggtactcggt gcagctggac gctctgcact tctcgacttt cgaggacagc 900 
gccctgacag cgatgatggc gcaaaagaat ttgtctcctt ctctccccgg gagcatcttg 960 
cagccagtgc aggaggaaaa cagcaaaact tcgcctcggt cccacagaag cgacggggaa 1020 
ttttcccccc attcccatta cagtgactcg gatgaggcaa gttag 1 065 



<210>2 
<211>354 
<212> PRT 
<213> Homo sapiens 

<400>2 

Met Ser Arg Leu Leu His Ala Glu Glu Trp Ala Glu Val Lys Glu Leu 
15 10 15 
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Gly Asp His His Arg Gin Pro Gin Pro His His Leu Pro Gin Pro Pro 
20 25 30 

Pro Pro Pro Gin Pro Pro Ala Thr Leu Gin Ala Arg Glu His Pro Val 
35 40 45 

Tyr Pro Pro Glu Leu Ser Leu Leu Asp Ser Thr Asp Pro Arg Ala Trp 
50 55 60 

Leu Ala Pro Thr Leu Gin Gly lie Cys Thr Ala Arg Ala Ala Gin Tyr 
65 70 75 80 

Leu Leu His Ser Pro Glu Leu Gly Ala Ser Glu Ala Ala Ala Pro Arg 
85 90 95 

Asp Glu Val Asp Gly Arg Gly Glu Leu Val Arg Arg Ser Ser Gly Gly 
100 105 110 

Ala Ser Ser Ser Lys Ser Pro Gly Pro Val Lys Val Arg Glu Gin Leu 
115 120 125 

Cys Lys Leu Lys Gly Gly Val Val Val Asp Glu Leu Gly Cys Ser Arg 
130 135 140 

Gin Arg Ala Pro Ser Ser Lys Gin Val Asn Gly Val Gin Lys Gin Arg 
145 150 155 160 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn 
165 170 175 

His Ala Phe Asp Gin Leu Arg Asn Val tie Pro Ser Phe Asn Asn Asp 
180 185 190 

Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr lie 
195 200 205 

Asn Ala Leu Ser Glu Leu Leu Gin Thr Pro Ser Gly Gly Glu Gin Pro 
210 215 220 

Pro Pro Pro Pro Ala Ser Cys Lys Ser Asp His His His Leu Arg Thr 
225 230 235 240 

Ala Ala Ser Tyr Glu Gly Qty Ala Gly Asn Ala Thr Ala Ala Gly Ala 
245 250 255 

Gin GIr. Ala Ser Gly Gly Ser Gin Arg Pro Thr Pro Pro Gly Ser Cys 
260 265 270 

Arg Thr Arg Phe Ser Ala Pro Ala Ser Ala Gly Gly Tyr Ser Val Gin 
275 280 285 

Leu Asp Ala Leu His Phe Ser Thr Phe Glu Asp Ser Ala Leu Thr Ala 
290 295 300 

Met Met Ala Gin Lys Asn Leu Ser Pro Ser Leu Pro Gly Ser lie Leu 
305 310 315 320 

Gin Pro Val Gin Glu Glu Asn Ser Lys Thr Ser Pro Arg Ser His Arg 
325 330 335 
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Ser Asp Gly Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu 
340 345 350 

Ala Ser 



<210>3 
<211>420 
<212> DNA 
<213> Mus musculus 

<400> 3 

tcgtcccacg cgtccgcgcc aagcccgcgg cgcggaggac accgtgctcg gttccgggct 60 
gcggggacat tcccggacac acaccggagc agcagctgcg ccgcgacaca tctggagccg 120 
cgtaggatgt tcgtcaaatc tgagactctg gagttgaagg aggaagagga ggtactgatg 180 
ctgctgggct cggcttcccc ggcctcggcg accctgaccc cgatgtcctc cagcgcggac 240 
gaggaggagg acgaggagct gcgccggccg ggctccgcgc gtgggcagcg tggagcggaa 300 
gccgggcagg gggtgcaggg cagtccggcg tcgggtgcct ggggttgccg gacagggcgg 360 
ctgctatgca ctgtgcacga gtgctagcgt gtgccgtcgc gctcacgggc cgtctgcaga 420 



<210>4 
<211>645 
<212> DNA 
<213> Mus musculus 

<400>4 

atggcgcctc atcccttgga tgcgctcacc atccaagtgt ccccagagac acaacaacct 60 
tttcccggag cctcggacca cgaagtgctc agttccaatt ccaccccacc tagccccact 120 
ctcataccta gggactgctc cgaagcagaa gtgggtgact gccgagggac ctcgaggaag 180 
ctccgcgccc gacgcggagg gcgcaacagg cccaagagcg agttggcact cagcaaacag 240 
cgaagaagcc ggcgcaagaa ggccaatgat cgggagcgca atcgcatgca caacctcaac 300 
tcggcgctgg atgcgctgcg cggtgtcctg cccaccttcc cggatgacgc caaacttaca 360 
aagatcgaga ccctgcgctt cgcccacaac tacatctggg cactgactca gacgctgcgc 420 
atagcggacc acagcttcta tggcccggag ccccctgtgc cctgtggaga gctggggagc 480 
cccggaggtg gctccaacgg ggactggggc tctatctact ccccagtctc ccaagcgggt 540 
aacctgagcc ccacggcctc attggaggaa ttccctggcc tgcaggtgcc cagctcccca 600 
tcctatctgc tcccgggagc actggtgttc tcagacttct tgtga 645 



<210>5 
<211>214 
<212> PRT 
<213> Mus musculus 

<400>5 

Met Ala Pro His Pro Leu Asp Ala Leu Thr lie Gin Val Ser Pro Glu 
1 5 10 15 

Thr Gin Gin Pro Phe Pro Gly Ala Ser Asp His Glu Val Leu Ser Ser 
20 25 30 

Asn Ser Thr Pro Pro Ser Pro Thr Leu lie Pro Arg Asp Cys Ser Glu 
35 40 45 

Ala Glu Val Gly Asp Cys Arg Gly Thr Ser Arg Lys Leu Arg Ala Arg 
50 55 60 
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Arg Gly Gly Arg Asn Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 
65 70 75 80 

Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 
85 90 95 

His Asn Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 
100 105 110 

Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 
115 120 125 

His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg lie Ala Asp His 
130 135 140 

Ser Phe Tyr Gly Pro Glu Pro Pro Val Pro Cys Gly Glu Leu Gly Ser 
145 150 155 160 

Pro Gly Gly Gly Ser Asn Gly Asp Trp Gly Ser lie Tyr Ser Pro Val 
165 170 175 

Ser Gin Ala Gly Asn Leu Ser Pro Thr Ala Ser Leu Glu Glu Phe Pro 
180 185 190 

Gly Leu Gin Val Pro Ser Ser Pro Ser Tyr Leu Leu Pro Gly Ala Leu 
195 200 205 

Val Phe Ser Asp Phe Leu 
210 



<210>6 
<211> 1412 
<212> DNA 
<213> Mus musculus 

<400> 6 

cgtgctcggt tccgggctgc ggggacattc ccggacacac accggagcag cagctgcgcc 60 
gcgacacatc tggagccgcg taggatgttc gtcaaatctg agactctgga gttgaaggag 120 
gaagaggagg tactgatgct gctgggctcg gcttccccgg cctcggcgac cctgaccccg 180 
atgtcctcca gcgcggacga ggaggaggac gaggagctgc gccggccggg ctccgcgcgt 240 
gggcagcgtg gagcggaagc cgggcagggg gtgcagggca gtccggcgtc gggtgccggg 300 
ggttgccggc cagggcggct gctgggcctg atgcacgagt gcaagcgtcg cccgtcgcgc 360 
tcacgggccg tctcccgagg tgccaagacg gcggagacgg tgcagcgcat caagaagacc 420 
cgcaggctca aggccaacaa ccgcgagcgc aaccgcatgc acaacctaaa cgccgcgctg 480 
gacgcgctgc gcgaggtgct gcccaccttc cccgaggatg ccaagctcac gaagatcgag 540 
acgctgcgct tcgcccacaa ttacatctgg gcgctcaccg agactctgcg cctggcggac 600 
cactgcgccg gcgccggtgg cctccagggg gcgctcttca cggaggcggt gctcctgagc 660 
ccgggagctg cgctcggcgc cagcggggac agcccttctc caccttcctc ctggagctgc 720 
accaacagcc cggcgtcatc ctccaactcc acgtccccat acagctgcac tttatcgccc 780 
gctagccccg ggtcagacgt ggactactgg cagcccccac ctccggagaa gcatcgttat 840 
gcgcctcacc tgcccctcgc cagggactgt atctagagct gcgggtctcc ctctctcgtc 900 
ctctacccgg ccctcttccc atccttctcc cgcccctcac cctccacgcc ccggactcca 960 
cttcacagag cagaggtggc ccttgcaatc ccctcggcgg ctggtgcatt cgggggtgga 1020 
gaccagctct ggtttattga agatgtgagg atttatggtc aaagaggact atggcgtgtg 1080 
ggagtggggg ctggcgtggg gaacctcgta agactgtaaa agacactgag aaaaagtacc 1 140 
ataactaacg agtgtgcaga gcagactgac gctcctcccc tctctcagag ctgctggagg 1200 
agaactccgg gcaggcagtt cgtgtgaatc tctcagaggg aatgcaactg gtccctgtga 1260 
tcttttcacc ttcgtttcta catagagatg ttaatgtcag tcgaaagaaa tgtattttag 1320 
catctgaatg aatttactgg taataatatt atccacacat ttgcaatggc tggcatctgc 1380 
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tctattccca ttgctgtctg caggctgtgg ga 1412 



<210> 7 

<211>263 

<212>PRT 

<213> Mus musculus 

<400> 7 

Met Phe Val Lys Ser Glu Thr Leu Glu Leu Lys Glu Glu Glu Glu Val 
1 5 10 15 

Leu Met Leu Leu Gly Ser Ala Ser Pro Ala Ser Ala Thr Leu Thr Pro 
20 25 30 

Met Ser Ser Ser Ala Asp Glu Glu Glu Asp Glu Glu Leu Arg Arg Pro 
35 40 45 

Gly Ser Ala Arg Gly Gin Arg Gly Ala Glu Ala Gly Gin Gly Val Gin 
50 55 60 

Gly Ser Pro Ala Ser Gly Ala Gly Gly Cys Arg Pro Gly Arg Leu Leu 
65 70 75 80 

Gly Leu Met His Glu Cys Lys Arg Arg Pro Ser Arg Ser Arg Ala Val 
85 90 95 

Ser Arg Gly Ala Lys Thr Ala Glu Thr Val Gin Arg lie Lys Lys Thr 
100 105 110 

Arg Arg Leu Lys Ala Asn Asn Arg Glu Arg Asn Arg Met His Asn Leu 
115 120 125 

Asn Ala Ala Leu Asp Ala Leu Arg Glu Val Leu Pro Thr Phe Pro Glu 
130 135 140 

Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala His Asn Tyr 
145 150 155 160 

lie Trp Ala Leu Thr Glu Thr Leu Arg Leu Ala Asp His Cys Ala Gly 
165 170 175 

Ala Gly Gly Leu Gin Gly Ala Leu Phe Thr Glu Ala Val Leu Leu Ser 
180 185 190 

Pro Gly Ala Ala Leu Gly Ala Ser Gly Asp Ser Pro Ser Pro Pro Ser 
195 200 205 

Ser Trp Ser Cys Thr Asn Ser Pro Ala Ser Ser Ser Asn Ser Thr Ser 
210 215 220 

Pro Tyr Ser Cys Thr Leu Ser Pro Ala Ser Pro Gly Ser Asp Val Asp 
225 230 235 240 

Tyr Trp Gin Pro Pro Pro Pro Glu Lys His Arg Tyr Ala Pro His Leu 
245 250 255 

Pro Leu Ala Arg Asp Cys lie 
260 
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<210>8 
<211>1957 
<212>DNA 
<213>Musmusculus 

<400> 8 

gatacacaca gatctagagg ctccaggaga cgatgcgaca ctcagcctga aaagatttgg 60 
aagatccaaa atgaaaactg attattgaat gaaattaaaa cctaaggtaa attaaggtta 120 
aagaaccatg ttaacactac cgtttgacga gtctgtcgta atgcccgaat cccagatgtg 180 
cagaaagttt gctagacaat gtgaggacca gaaacaaatt aagaaaccag agagctttcc 240 
aaaacaagtt gtccttcgag gaaagagcat taaaagggcc cctggagaag aaaccgagaa 300 
agaagaggag gaagaagaca gagaggaaga agatgagaat ggcttgtcca gaaggagggg 360 
gctcaggaaa aaaaagacca ccaaactacg actggaaagg gtcaagttca ggagacagga 420 
agctaatgcg cgcgagagga accggatgca cggcctcaat gatgctctgg acaatttgcg 480 
aaaagtggtc ccctgttact ctaaaaccca aaaactgtcc aaaatagaaa ctttacgact 540 
ggccaaaaat tacatctggg cactttctga aattctgagg attggcaaga gaccggatct 600 
gctcacgttc gtccaaaact tatgcaaagg tctttcccag ccaactacaa acttggtggc 660 
aggctgctta cagctcaacg ccagaagttt cctgatgggt cagggtgggg aggctgccca 720 
ccacacaagg tcaccctact ccacattcta cccaccctac cacagccctg agctggccac 780 
tcccccaggg catgggactc ttgataattc caagtccatg aaaccctaca attactgcag 840 
tgcatatgaa tccttctatg aaagtacctc ccctgagtgt gccagccctc agtttgaagg 900 
tcccttaagt cctcccccaa ttaactataa tgggatattt tccctgaagc aagaagaaac 960 
cttggactat ggcaaaaatt acaattatgg catgcattac tgtgcagtgc cacccagggg 1020 
tccccttggg cagggtgcca tgttcaggtt gcccaccgac agccacttcc cttacgactt 1080 
acatctgcgc agccaatctc tcactatgca agatgaatta aatgcagttt ttcataatta 1 140 
atgaggaaaa tgaaaataaa cagtggtcat tcacctccca ctctaattaa ggcaaagcag 1200 
atgcttgtgg gctgcgtaat tggcacaact ctatctaagg tgtttactag tttctgaagt 1260 
gtgtctcaaa gattgtgacc attttctatg tcataataaa tcccttttcg tatgagaact 1320 
tcctttcctt ccctcttgtc tgtatcacac tgtgattctc tctctctctc tctctctctc 1380 
tctctctctc tctctctctt actggcagaa tatttctttc ttgttttagt ttctttcaaa 1440 
ttcacttaat ttgtttgaac aaggtgtcta agatgttgct gaataaagac atgcacacag 1 500 
catacttcaa tgtctatttc agttgtacag ttatgatgaa aatgcatgtt ataaaaatca 1560 
gatgagtaaa atgtgtttat aattactagg attcatatat gtatctctga aattttagtt 1620 
vltaaaatat taagagctaa ccatgaaatt aaaaggtgca tttggggatg cacaacggta 1680 
tcaaaagcta tgcaattttc tgtttattag ggacaaaaat aagtgtattc agttggtaac 1740 
aactattcct cttcaagcat tttcagagga ggaaacacgg tatttggggg aggttatcag 1800 
tgtcataatt tggggacaat taattcaatc atgaagaaaa aaaatattag cacttgtttt 1860 
gtattgttca ggatttttct gtacaggttt gttacagtgt ataattgtgt tttocatcct 1 920 
acagtttaaa agcaattaaa catagatatt tccactt 1 957 



<210>9 
<211>337 
<212> PRT 
<213> Mus musculus 

<400>9 

Met Leu Thr Leu Pro Phe Asp Glu Ser Val Val Met Pro Glu Ser Gin 
15 10 15 

Met Cys Arg Lys Phe Ala Arg Gin Cys Glu Asp Gin Lys Gin lie Lys 
20 25 30 

Lys Pro Glu Ser Phe Pro Lys Gin Val Val Leu Arg Gly Lys Ser lie 
35 40 45 

Lys Arg Ala Pro Gly Glu Glu Thr Glu Lys Glu Glu Glu Glu Glu Asp 
50 55 60 
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Arg Glu Glu Glu Asp Glu Asn Gly Leu Ser Arg Arg Arg Gly Leu Arg 
65 70 75 80 

Lys Lys Lys Thr Thr Lys Leu Arg Leu Glu Arg Val Lys Phe Arg Arg 
85 90 95 

Gin Glu Ala Asn Ala Arg Glu Arg Asn Arg Met His Gly Leu Asn Asp 
100 105 110 

Ala Leu Asp Asn Leu Arg Lys Vai Val Pro Cys Tyr Ser Lys Thr Gin 
115 120 125 

Lys Leu Ser Lys lie Glu Thr Leu Arg Leu Ala Lys Asn Tyr lie Trp 
130 135 140 

Ala Leu Ser Glu lie Leu Arg He Gly Lys Arg Pro Asp Leu Leu Thr 
145 150 155 160 

Phe Val Gin Asn Leu Cys Lys Gly Leu Ser Gin Pro Thr Thr Asn Leu 
165 170 175 

Val Ala Gly Cys Leu Gin Leu Asn Ala Arg Ser Phe Leu Met Gly Gin 
180 185 190 

Gly Gly Glu Ala Ala His His Thr Arg Ser Pro Tyr Ser Thr Phe Tyr 
195 200 205 

Pro Pro Tyr His Ser Pro Glu Leu Ala Thr Pro Pro Gly His Gly Thr 
210 215 220 

Leu Asp Asn Ser Lys Ser Met Lys Pro Tyr Asn Tyr Cys Ser Ala Tyr 
225 230 235 240 

Glu Ser Phe Tyr Glu Ser Thr Ser Pro Glu Cys Ala Ser Pro Gin Phe 
245 250 255 

Glu Gly Pro Leu Ser Pro Pro Pro lie Asn Tyr Asn Gly He Phe Ser 
260 265 270 

Leu Lys Gin Glu Glu Thr Leu Asp Tyr Gly Lys Asn Tyr Asn Tyr Gly 
275 280 285 

Met His Tyr Cys Ala Val Pro Pro Arg Gly Pro Leu Gly Gin Gly Ala 
290 295 300 

Met Phe Arg Leu Pro Thr Asp Ser His Phe Pro Tyr Asp Leu His Leu 
305 310 315 320 

firg Ser Gin Ser Leu Thr Met Gin Asp Glu Leu Asn Ala Val Phe His 
325 330 335 

Asn 



<210> 10 
<211>1056 
<212> DNA 
<213> Mus musculus 
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<400> 10 

atgtcccgcc tgctgcatgc agaagagtgg gctgaggtaa aagagttggg ggaccaccat 60 
cgccatcccc agccgcacca cgtcccgccg ctgacgccac agccacctgc taccctgcag 120 
gcgagagacc ttcccgtcta cccggcagaa ctgtccctcc tggatagcac cgacccacgc 180 
gcctggctga ctcccacttt gcagggcctc tgcacggcac gcgccgccca gtatctgctg 240 
cattctcccg agctgggtgc ctccgaggcc gcggcgcccc gggacgaggc tgacagccag 300 
ggtgagctgg taaggagaag cggctgtggc ggcctcagca agagccccgg gcccgtcaaa 360 
gtacgggaac agctgtgcaa gctgaagggt ggggttgtag tggacgagct tggctgcagc 420 
cgccagcgag ccccttccag caaacaggtg aatggggtac agaagcaaag gaggctggca 480 
gcaaacgcaa gggaacggcg caggatgcac gggctgaacc acgccttcga ccagctgcgc 540 
aacgttatcc cgtccttcaa caacgacaag aagctgtcca aatatgagac cctacagatg 600 
gcccagatct acatcaacgc tctgtcggag ttgctgcaga ctcccaatgt cggagagcaa 660 
ccgccgccgc ccacagcttc ctgcaaaaat gaccaccatc acctlcgcac cgcctcctcc 720 
tatgaaggag gtgcgggcgc ctctgcggta gctggggctc agccagcccc gggagggggc 780 
ccgagaccta ccccgcccgg gccttgccgg actcgcttct caggcccagc ttcctctggg 840 
ggttactcgg tgcagctgga cgctttgcac ttcccagcct tcgaggacag ggccctaaca 900 
gcgatgatgg cacagaagga cctgtcgcct tcgctgcccg ggggcatcct gcagcctgta 960 
caggaggaca acagcaaaac atctcccaga tcccacagaa gtgacggaga gttttccccc 1020 
cactctcatt acagtgactc tgatgaggcc agttag 1 056 



<210>11 

<211>351 

<212> PRT 

<213> Mus musculus 

<400>11 

Met Ser Arg Leu Leu His Ala Glu Glu Trp Ala Glu Val Lys Glu Leu 
1 5 10 15 

Gly Asp His His Arg His Pro Gin Pro His His Val Pro Pro Leu Thr 
20 25 30 

Pro Gin Pro Pro Ala Thr Leu Gin Ala Arg Asp Leu Pro Val Tyr Pro 
35 40 45 

Ala Glu Leu Ser Leu Leu Asp Ser Thr Asp Pro Arg Ala Trp Leu Thr 
50 55 60 

Pro Thr Leu Gin Gly Leu Cys Thr Ala Arg Ala Ala Gin Tyr Leu Leu 
65 70 75 80 

His Ser Pro Glu Leu Gly Ala Ser Glu Ala Ala Ala Pro Arg Asp Glu 
85 90 95 

Ala Asp Ser Gin Gly Glu Leu Val Arg Arg Ser Gly Cys Gly Gly Leu 
100 105 110 

Ser Lys Ser Pro Gly Pro Val Lys Val Arg Glu Gin Leu Cys Lys Leu 
115 120 125 

Lys Gly Gly Val Val Val Asp Glu Leu Gly Cys Ser Arg Gin Arg Ala 
130 135 140 

Pro Ser Ser Lys Gin Val Asn Gly Val Gin Lys Gin Arg Arg Leu Ala 
145 150 155 160 

Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn His Ala Phe 
165 170 175 
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Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp Lys Lys Leu 
180 185 190 

Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr lie Asn Ala Leu 
195 200 205 

Ser Glu Leu Leu Gin Thr Pro Asn Val Gly Glu Gin Pro Pro Pro Pro 
210 215 220 

Thr Ala Ser Cys Lys Asn Asp His His His Leu Arg Thr Ala Ser Ser 
225 230 235 240 

Tyr Glu Gly Gly Ala Gly Ala Ser Ala Val Ala Gly Ala Gin Pro Ala 
245 250 255 

Pro Gly Gly Gly Pro Arg Pro Thr Pro Pro Gly Pro Cys Arg Thr Arg 
260 265 270 

Phe Ser Gly Pro Ala Ser Ser Gly Gly Tyr Ser Val Gin Leu Asp Ala 
275 280 285 

Leu His Phe Pro Ala Phe Glu Asp Arg Ala Leu Thr Ala Met Met Ala 
290 295 300 

Gin Lys Asp Leu Ser Pro Ser Leu Pro Gly Gly He Leu Gin Pro Val 
305 310 315 320 

Gin Glu Asp Asn Ser Lys Thr Ser Pro Arg Ser His Arg Ser Asp Gly 
325 330 335 

Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu Ala Ser 
340 345 350 



<210> 12 
<211>3261 
<212> DNA 
<213> Mus musculus 

<400>12 

aatctataca tggcaaaaat gtatatgaaa tccaaggaca tggtggagct ggtcaacaca 60 
caatcctgga tggacaaagg tctgagctct caaaatgaga tgaaggagca agagagaaga 120 
ccgggctctt atggaatgct cggaacctta actgaagagc atgacagtat tgaggaggat 180 
gaagaagagg aagaagatgg agataaacct aaaagaagag gtcccaagaa aaagaagatg 240 
actaaagctc gccttgaaag attcagggct cgaagagtca aggccaatgc tagagaacgg 300 
acccggatgc atggcctgaa tgatgccttg gataatctta ggagagtcat gccatgttac 360 
tctaaaactc aaaagctttc caagatagag actcttcgac tggcaaggaa ctacatctgg 420 
gccttgtctg aagtcctgga gactggtcag acacttgaag ggaagggatt tgtagagatg 480 
ctatgtaaag gtctctctca acccacaagc aacctggttg ctggatgcct ccaactgggg 540 
cctcaatcta ccctcctgga gaagcatgag gaaaaatctt caatttgtga ctctactatc 600 
tctgtccaca gcttcaacta tcagtctcca gggctcccca gccctcctta tggccatatg 660 
gaaacacatt ctctccatct caagcctcaa ccatttaaga gtttgggtga ctcttttggg 720 
agccatccac ctgactgcag taccccccct tatgagggtc cactcacacc acccctgagc 780 
attagtggca acttctcctt aaagcaagac ggctcccctg atttggaaaa atcctacaat 840 
ttcatgccac attatacctc tgcaagtcta agttcagggc atgtgcattc aactcccttt 900 
cagactggca ctccccgcta tgatgttcct gtagacctga gctatgattc ctactcccac 960 
catagcattg gaactcagct caatacgatc ttctctgatt agagcaataa gataagcacc 1020 
aatatttcag agaatgaagt ggagattttt ttcacatttc tagtggctga gctaaactct 1080 
cagaaaattt aaaagaacct ttggatatgc atcaaacata atagtcctag tttgttcaga 1 140 
acttcctgta cctgctaact ttcttcccat taacttctca cattggacca gtcctacatt 1200 
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tggtaaactt aagtgaatat atttgatggt ttgaggccac atggtaatag aacagaaaga 1260 
aagcccaggc cctgttccaa tggtgccaaa gattaattga atgctctgcc aattaacttt 1320 
ccatttccag tgtttttatt gctttctgat aaacatgaag caactgttcc aaatcaacat 1380 
atacctttca cctcccacac atttttaaat ttaaaagcaa tcaaagcaaa tagcaaaaat 1440 
ggaatgatta tacagaactg gaaggagcat caagtacatg tctgttggct tatagaatac 1500 
aaaatttgtg tgaatttgac aaatcatctt tgtgtctatt taaatataac ttccagagcc 1560 
taaaataata acttggatgt taccataaat aaccagtatg ttctttaaga gatcagctct 1620 
acttattact gtgctgaaag gtatacacac cttttttagt gattggagaa ccatgataga 1680 
agcctcacac aaactttatt ctttatacta tttaaaaaac aactgtctta gtactaggag 1740 
acaagcaaga aagacattga aatttctctt ctggcacaca gaaatatcac ctagctcatt 1800 
tcttagctcc cgtgactata gggctgtaga attttgcagg tattcaggtg cttcagttag 1860 
aatcagaact cagccaggtt catactgtag agcaataaaa tggtggttgg ctgctatccc 1920 
aactaacaca gttaaagaac tctgcctcac acatagccac tggaaaaatg tggatattct 1980 
ccagcaagat gaatttcact gtttaaagca atgcaattaa agccatagag tttcgtccac 2040 
tccagtatca tatattccag aactgttgta atcaatcttg aattcttaca acataaatgc 2100 
aactccttac ttcccaccta acactgattg ttatattgtc ttcaattcca agatattatg 2160 
oaattatatg caaaattttg attagaatca aaattaagag tcaatgaatc tgtctgtatc 2220 
ttcaggacgg gttttgatca gttttaagaa agtttatttt cctttatgtg gcatctcttt 2280 
tctttgtaac cacactggtt cagccaagtt tctcttctcc agagaaatta gctctgagaa 2340 
attttactat catgatccat cttccacagc aattatttag gttcaactca agagtataca 2400 
tagtttattt atagtgggtg aggatacacc tccaagaata aattttaaca acattaatga 2460 
catatgaata tgccatttta tctaccaaac tatatatgta tctcttttct ttattgccct 2520 
tatttatttc ttcacactga gagttatttt gtgtccatct tattgcagca cttactctgc 2580 
tctactttgc acctttggat tataaatatg tttaaaagtc tgtaaagacg tcttaaacaa 2640 
ctcgtgacag taattcacca cccctaagac cttgaatcac cctagtggaa ataggcaagg 2700 
agaattattt atagaatcat cctatgtaat tttttttgag aatttgctct acctagcatt 2760 
• tatgtttata gataattgct atctgcatta tttattaggt tctatttatt taatttatet 2820 
ttctttcttt ttatgtaaac atttgtgccc catagatata gcctcaaagc ttcactggga 2880 
aactagctta tatgtttgga gtgagagaaa aggagaaaat cagttcttga ttgcttgcaa 2940 
tggttttata aaacagagca ataatttgaa tagatatgca acttaatggt ttlagaattt 3000 
ttcctttaag gtgcaacaga gttacattat tatttatgac tttggagaat gtagtacatg 3060 
tgaaccagga ctgtaggctt gtgaagagag attttataat taaatacaaa tttagtactg 3120 
tactatgctt ggaaagaact tgttctttaa ataatgttta gtcttctggg agtgttttca 3180 
gataaaatga agcaattgtt taaaagaaat ggctgttttc cctccctttt ccagtagcaa 3240 
taaagctttg agtgttatta c 3261 



<210> 13 
<211>501 
<212> DNA 
<213> Zebra Fish 

<220> 

<221> modified base 
<222>(115) 

<223> N = any nucleotide 
<400>13 

atgttaacgg taccgtttga agagccagat atgatgcgcg agtctcagtt Iggcgccacg 60 
ttcacgcgtc aggaagacgt ccggacactc agcagcgccg agctcaagga ggcanaggac 120 
gacaacacgg acagggagga ggaggaggag agagaggagg actataacgg gctgccaaag 180 
aagaagggtc cccgcaaaaa gaaatccgag ggacgcggtg accgagtcaa aatgcgccgt 240 
caggaagcaa acgcgcgtga gcgcagccgc atgcacggtc taaacgacgc gctcgaaagc 300 
ctgcgcaaag tcgtgccgtg ctactccaaa acgcagaaac tctccaagat cgaaaccctg 360 
gggctgggca agaattacat ttgggctctg tctgagactt tgagcgcagg aaagcgacct 420 
ggcctgcttg ggttggtagc aacx:ctgggc gtgggctggt ctagaggaca gacx:agcttg 480 
gtgggogaot gcctgcagct a 501 



<210> 14 
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<211>609 

<212> DNA 

<213> Homo sapiens 

<400> 14 

tttgacattg aattatgctg tgtgcatgtc tttattcaac agtatattct tagacacctt 60 
gttcaaacaa attaaataag tttaaaagaa aattaaaaag aaaaaaatct ttccagtaga 120 
aacagaatca cagtgcttca cagacaaaag gaaaggaaaa gaagttctca tacgaaaaga 180 
gatttattat tacatagaaa attctcacaa tagttgaaac acacttcaga aactagtaaa 240 
caccttagat agagttgtgc caattactca gcccacaagc atctgctttg tcttaattag 300 
acaggggagg tgaatgacca ctgtttattt tcattttcct cattaattat gaaaaactgc 360 
atttaattca tcttgcattg tgagagattg gctgcgcaga tgtaagtcgt aagggaagtg 420 
gctgtcggtg ggcaacctga acatggcacc ctgcccaagg ggacccctgg gtggcactgc 480 
acagtaatgc atgccgtaat tgtaattttt accatagtcc aaggtttctt cttgcttcag 540 
ggaaaatatc ccattatagg taattggggg aggacttaag ggaccttcaa actgagggct 600 
ggcacactc 609 



<210>15 
<211>675 
<212> DNA 
<213> Homo sapiens 

<400>15 

tagagagcgg caggagacga tgcgacactc agcctgaaaa gatttggaag acccaaaatg 60 
aaaactgatt attgaatgaa attaaaacct aaggtaattt aagattagag aaccatgtta 120 
acactaccgt ttgatgagtc tgttgtaatg ccagaatccc agatgtgcag aaagttttct 180 
agagaatgcg aggaccagaa gcaaattaag aagccagaaa gcttttccaa acagattgtc 240 
cttcgaggaa agagcatcaa aagggcccct ggagaagaaa ccgagaaaga agaagaggag 300 
gaagacaggg aagaggaaga tgaaaatggg ttgcctagaa ggaggggtct taggaaaaaa 360 
aagacaacaa agctgcgatt ggaaagggtc aagttcagga gacaggaagc gaacgcgcgc 420 
gagaggaaca ggatgcacgg cctcaacgac gctctggaca acttaagaaa agtggtccgc 480 
tgttattcta tgacccagag actgtccaaa atagagactg tactactggc caaaaactac 540 
atctgggcac tgtctgatat gctgagaatc ggcaagagac cagatctgct cacgattcgg 600 
caaagctgat gcatagggtc ttgccagcca actacagact tggtggcagg ctggtcgcag 660 
ctcaacgaca ggagt 675 



<210>16 
<211> 1476 
<212> DNA 
<213> brosophila 

<220> 

<221> modified_base 
<222> (203)..(217) 
<223> N = any nucleotide 

<400> 16 

ctccaccctc gtgtgccgtt gagcgtgaaa gtcacaaatt atgggcgaca gagggagagg 60 
gagagagaga gaggggaaga tcccatggca tgatagtacc aatttggcaa gatatgacgc 120 
gtgcdggcg cgagagagat gacaacaggc gaagtgtagg cgtttcacca ccgagcgaaa 180 
gggagggaaa catatctaac ccntaagtca ccccagntcc tcggggagat cctttggccg 240 
ggccatggta atagctgcga ggatcctggg gccaggggtc atttgcgggc catttacaca 300 
aacttggtca gaacggcggt gaaaatattt gtagaatgca tctcgcgggg gttgaccgta 360 
gtcaagtctg gatccgatcg gatcgtttca gttgcaacga aactttcaag ccgcgcggat 420 
ctgcaatgag ctactactac tcgtctgcct ccgaggagga tggcagttcc cagtatctgg 480 
gcagtcccaa ctacaacttg acccagttgc cgccagtttc tggccaggat tacggacagg 540 
gggctttctt atcgccggaa tggcaattct tggatgccgc aggcggaact caaacggaac 600 
taggacccat aatggaggtg caaggacagc acacccagcc gcagaccaaa cggcggagta 660 
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acagctccac aggatcggac ggtaggaaga gcagtccaga gcagaccaat ctcagtccca 720 
cggtccagaa gagaagacga caggctgcca atgcgcggga aaggaagcgg atgaatggat 780 
taaatgcggc tttcgagcgc ctaagggaag tggtgcccgc tccgtccatt gaccagaaat 840 
tgtccaagtt cgagactctc cagatggccc agtcctacat cctggcgctg tgcgatctcc 900 
tcaacaacgg ggacgtggaa gtggatgccg ctgcatacac catcttcggc gacagcgata 960 
gtggatttgg attgagcgga ggatccttgt catagatgga tgatactaga ctaaagttal 1020 
gtgattttct tcttactgta gattaagtta aatatgtaat gaaataaatt gaaatgttta 1080 
^ttgaatgct aaaaataatt tcaatttcaa aggcattatt catggaacgc atcgcttgat 1 140 
tgtgaaccaa gtgttcaatc caaaatgagc tactcatcta accctttcta ccaattataa 1200 
ccagcccact actatacaca gatccccaga ttcaattggc catcagatcg tttggctgcg 1260 
cctcaatgat ttcgaaaacg atcataaatt aattactcaa cagttcattg ataggcgtac 1320 
ttcaagtttt grtctaaccc cgaacgattc cgaaactgcg aagccaccac agtgggcggt 1380 
gtagcgtgac acccctgaac tcctggccag aactcctctc attgaataaa aaaggcagct 1440 
gagtccttcg gcttcggttg ggtgctgttt tttttt 1476 



<210> 17 
<211>189 
<212> PRT 
<213> Drosophila 

<400>17 

Met Ser Tyr Tyr Tyr Ser Ser Ala Ser Glu Glu Asp Gly Ser Ser Gin 
1 5 10 15 

Tyr Leu Gly Ser Pro Asn Tyr Asn Leu Thr Gin Leu Pro Pro Val Ser 
20 25 30 

Gly Gin Asp Tyr Gly Gin Gly Ala Phe Leu Ser Pro Glu Trp Gin Phe 
35 40 45 

L.eu Asp Ala Ala Gly Gly Thr Gin Thr Glu Leu Gly Pro lie Met Glu 
50 55 60 

Val Gin Gly Gin His Thr Gin Pro Gin Thr Lys Arg Arg Ser Asn Ser 
65 70 75 80 

Ser Thr Gly Ser Asp Gly Arg Lys Ser Ser Pro Glu Gin Thr Asn Leu 
85 90 95 

Ser Pro Thr Val Gin Lys Arg Arg Arg Gin Ala Ala Asn Ala Arg Glu 
100 105 110 

Arg Lys Arg Met Asn Gly Leu Asn Ala Ala Phe Glu Arg Leu Arg Glu 
115 120 125 

Val Val Pro Ala Pro Ser lie Asp Gin Lys Leu Ser Lys Phe Glu Thr 
130 135 140 

Leu Gin Met Ala Gin Ser Tyr lie Leu Ala Leu Cys Asp Leu Leu Asn 
145 150 155 160 

Asn Gly Asp Val Glu Val Asp Ala Ala Ala Tyr Thr lie Phe Gly Asp 
165 170 175 

Ser Asp Ser Gly Phe Gly Leu Ser Gly Gly Ser Leu Ser 
180 185 



<210> 18 
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<211>1074 
<212> DNA 
<213> chicken 

<400> 18 

tttttccttt cccttttcct cgcggagccg cggccgcttt gccttcccct acagcgagga 60 
gctccgcttt ctcctctatt tatttctaaa ccgttattgc ttaaccgcag ccccgggagt 120 
gccttlccat gtgagtacgg gggacacagg cacctgcgat gcggtgcggc cgtgccaagc 180 
actcccaccc tccctccgtc cctccttggg gttactttgg gatatttttt ccctcccttt 240 
ccccctcccc tcagcacctt ccccccgctc cccctgccgc ccactgacgg ctccggcttc 300 
tctcccgcag gatgcccgcg gaggcggcga gcagcggcgg cgtttcggag ccgcccggag 360 
ctccgcggga gcggcggagg agacgcggcc gtgcgcgggc gcggaccgag gctttgctgc 420 
acaccctcaa acggagccgc cgggtgaaag ccaacgaccg ggagcggaac cgcatgcacc 480 
acctcaacgc cgcgctggat gagctccgca gcgtcctgcc gaccttcccc gacgacacca 540 
rnactcaccaa aatcgaaacc ctgcgcttcg cttacaacta catctgggcc ctctccgaga 600 
cccttcgttt ggccgagcag tgcctccctc ctccccccgc cttccgcggg ccccccgcgc 660 
cccccagccc cggcagcgac gccggttcgt ggctgtccag cggttccccg gccgccccct 720 
cgctctgcgc ctccgcctcc ggccccagca gcccggccac ctccgaggac tgcggctacg 780 
tcccctcgga cgccctgcgg gccttccgcg ggctgccccc cgccgccccg ggcgctccct 840 
gccgctagcc ctgcccgtgc gtgtctccgt ccccccccac cttclccgta tcccgttgca 900 
cttttcagcc cctcccgccc cccagcccct ctctccgggg tgccctttcc cttcgccccc 960 
ccgcctcgtt ttccatacga cttggaaaac ccggcaaaga aaagcgacag atttgctgcc 1020 
gcagacgagg tgaaaagtca attttacaat ttgtagctct ccggtgaaga aaaa 1 074 



<210> 19 
<211>178 
<212> PRT 
<213> chicken 

<400>19 

Met Pro Ala Glu Ala Ala Ser Ser Gly Gly Val Ser Glu Pro Pro Gly 
1 5 10 15 

Ala Pro Arg Glu Arg Arg Arg Arg Arg Gly Arg Ala Arg Ala Arg Thr 
20 25 30 

Glu Ala Leu Leu His Thr Leu Lys Arg Ser Arg Arg Val Lys Ala Asn 
35 40 45 

Asp Arg Glu Arg Asn Arg Met His His Leu Asn Ala Ala Leu Asp Glu 
50 55 60 

Leu Arg Ser Val Leu Pro Thr Phe Pro Asp Asp Thr Lys Leu Thr Lys 
65 70 75 80 

lie Glu Thr Leu Arg Phe Ala Tyr Asn Tyr He Trp Ala Leu Ser Glu 
85 90 95 

Thr Leu Arg Leu Ala Glu Gin Cys Leu Pro Pro Pro Pro Ala Phe Arg 
100 105 110 

Gly Pro Pro Ala Pro Pro Ser Pro Gly Ser Asp Ala Gly Ser Trp Leu 
115 120 125 

Ser Ser Gly Ser Pro Ala Ala Pro Ser Leu Cys Ala Ser Ala Ser Gly 
130 135 140 

Pro Ser Ser Pro Ala Thr Ser Glu Asp Cys Gly Tyr Val Pro Ser Asp 
145 150 155 160 
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Ala Leu Arg Ala Phe Arg Gly Leu Pro Pro Ala Ala Pro Gly Ala Pro 
165 170 175 



Cys Arg 



<210> 20 
<211>790 
<212> DNA 
<213> chicken 

<400> 20 

gcagccgccg ggcccccggg ccgctccgac gccgcccggg cgcgtgccag cgccccccga 60 
ccgcccgacg gccgtcgacc agcggccgcg cccccctctc ccccgccccc gcaggatgcc 120 
ggtgaaggcg gagagcccgg cgcccgcggc ggaggacgaa ctgctgctgc tgcgcctcgc 180 
ctcgcccgcc ccctcggcct cgctgccgtc cagcgccggc gaggaggacg aggacgagga 240 
ggacgggcgg ccgcggcggc tgcaggaggg cgctcggcgg gcggggcggc agcgagggcc 300 
cccgcgggcg gcgcgcacgg cggagacggc gcagcgcatc aagcggagcc ggcggctgaa 360 
agccaacaac cgcgagcgca accgcatgca caacctgaac gcggcgctgg acgcgctgcg 420 
rtgacgtgctg cccaccttcc ccgaggacgc caagctcacc aagatcgaga cgctgcgctt 480 
cgcccacaac tacatctggg cgctcaccga gacgctgcgc ctggccgggg ccgcccgcct 540 
ggggOQcgcc gccgacgccg cgcccggggc ggccgccgag ggcagcccct cgcccgcctc 600 
gtcgtggagc ggcggcgcca gccccgcgcc ctccgcctcg ccctacgcct gcactttatc 660 
gcccggcagc cccgccggct ccgcctcgga cgccgagcac tggccgcccc cgcggggccg 720 
cttcgccccg ccgccgccgc cccaccgctg cctctaacgc ggcccgggcg gccctcgctc 780 
ctccgacgtg 790 



<210>21 
<211>213 
<212> PRT 
<213> chicken 

<400>21 

Met Pro Val Lys Ala Glu Ser Pro Ala Pro Ala Ala Glu Asp Glu Leu 
1 5 10 15 

Leu Leu Leu Arg Leu Ala Ser Pro Ala Pro Ser Ala Ser Leu Pro Ser 
20 25 30 

Ser Ala Gly Glu Glu Asp Glu Asp Glu Glu Asp Gly Arg Pro Arg Arg 
35 40 45 

Leu Gin Glu Gly Ala Arg Arg Ala Gly Arg Gin Arg Gly Pro Pro Arg 
50 55 60 

Ala Ala Arg Thr Ala Glu Thr Ala Gin Arg lie Lys Arg Ser Arg Arg 
65 70 75 80 

Leu Lys Ala Asn Asn Arg Glu Arg Asn Arg Met His Asn Leu Asn Ala 
85 90 95 

Ala Leu Asp Ala Leu Arg Asp Val Leu Pro Thr Phe Pro Glu Asp Ala 
100 105 110 

Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala His Asn Tyr lie Trp 
115 120 125 
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Ala Leu Thr Glu Thr Leu Arg Leu Ala Gly Ala Ala Arg Leu Gly Gly 
130 135 140 

Ala Ala Asp Ala Ala Pro Gly Ala Ala Ala Glu Gly Ser Pro Ser Pro 
145 150 155 160 

Ala Ser Set Trp Ser Gly Gly Ala Ser Pro Ala Pro Ser Ala Ser Pro 
165 170 175 

Tyr Ala Cys Thr Leu Ser Pro Gly Ser Pro Ala Gly Ser Ala Ser Asp 
180 185 190 

Ala Glu His Trp Pro Pro Pro Arg Gly Arg Phe Ala Pro Pro Pro Pro 
195 200 205 

Pro His Arg Cys Leu 
210 



<210> 22 
<211>450 
<212> DNA 
<213> Mus musculus 

<400> 22 

atgaagtcgg cctgcaaacc ccacggccct ccggcgggag ctcgcggcgc gcccccgtgc 60 
gcgggcgcag ccgagcgcgc ggtctcgtgc gcggggcccg ggcggctgga gagcgcggcg 120 
cgcaggcgtc tggcggccaa cgcgcgcgag cggcgccgca tgcaggggct gaacacggcg 180 
licgaccggc tgcgcagggt ggtgccgcag tggggccagg acaagaagct gtccaagtac 240 
gagacactgc agatggcgct cagctacato atcgcgctca cccgcatcct agccgaagcc 300 
gagcgggact gggtcgggct gcgctgcgag cagcggggcc gcgatcaccc ctacctccct 360 
ttcccgggtg ctaggctcca ggtagaccct gagccctatg ggcagaggct cttcggcttc 420 
cagccggagc ccttccccat ggccagctaa 450 



<210>23 
<211> 149 
<212> PRT 
<213> Mus musculus 

<400> 23 

Met Lys Ser Ala Cys Lys Pro His Gly Pro Pro Ala Gly Ala Arg Gly 
15 10 15 

Ala Pro Pro Cys Ala Gly Ala Ala Glu Arg Ala Val Ser Cys Ala Gly 
20 25 30 

Pro Gly Arg Leu Glu Ser Ala Ala Arg Arg Arg Leu Ala Ala Asn Ala 
35 40 45 

Arg Glu Arg Arg Arg Met Gin Gly Leu Asn Thr Ala Phe Asp Arg Leu 
50 55 60 

Arg Arg Val Val Pro Gin Trp Gly Gin Asp Lys Lys Leu Ser Lys Tyr 
65 70 75 80 

Glu Thr Leu Gin Met Ala Leu Ser Tyr lie lie Ala Leu Thr Arg He 
85 90 95 

Leu Ala Glu Ala Glu Arg Asp Trp Val Gly Leu Arg Cys Glu Gin Arg 



15 



wo 03/047532 



PCT/US02/41458 



100 105 110 

Gly Arg Asp His Pro Tyr Leu Pro Phe Pro Gly Ala Arg Leu Gin Val 
115 120 125 

Asp Pro Glu Pro Tyr Gly Gin Arg Leu Phe Gly Phe Gin Pro Glu Pro 
130 135 140 

Phe Pro Met Ala Ser 
145 



<210> 24 
<211>861 
<212> DNA 
<213> Mus musculus 

<400> 24 

attcttttga gtcgggagaa ctaggtaaca attcggaaac tccaaagggt ggatgagggg 60 
cgcgcggggt gtgtgtgggg gatactctgg tcccccgtgc agtgacctct aagtcagagg 120 
ctggcacaca cacaccttcc attttttccc aaccgcagga tggcgcctca tcccttggat 180 
gcgctcacca tccaagtgtc cccagagaca caacaacctt ttcccggagc ctcggaccac 240 
gaagtgctca gttccaattc caccccacct agccccactc tcatacctag ggactgctcc 300 
gaagcagaag tgggtgactg ccgagggacc tcgaggaagc tccgcgcccg acgcggaggg 360 
cgcaacaggc ccaagagcga gttggcactc agcaaacagc gaagaagccg gcgcaagaag 420 
gccaatgatc gggagcgcaa tcgcatgcac aacctcaact cggcgctgga tgcgctgcgc 480 
ggtgtcctgc ccaccttccc ggatgacgcc aaacttacaa agatcgagac cctgcgcttc 540 
gcccacaact acatctgggc actgactcag acgctgcgca tagcggacca cagcttctat 600 
ggcccggagc cccctgtgcc ctgtggagag ctggggagcc ccggaggtgg ctccaacggg 660 
gactggggct ctatctactc cccagtctcc caagcgggta acctgagccc cacggcctca 720 
ttggaggaat tccctggcct gcaggtgccc agctccccat cctatctgct cccgggagca 780 
ctggtgttct cagacttctt gtgaagagac ctgtctggct ctgggtggtg ggtgctagtg 840 
gaaagggagg ggaccacagc c 861 



<210>25 
<211>214 
<212> PRT 
<213>Mus musculus 

<400>25 

Met Ala Pro His Pro Leu Asp Ala Leu Thr lie Gin Val Ser Pro Glu 
15 10 15 

Thr Gin Gin Pro Phe Pro Gly Ala Ser Asp His Glu Val Leu Ser Ser 
20 25 30 

Asn Ser Thr Pro Pro Ser Pro Thr Leu lie Pro Arg Asp Cys Ser Glu 
35 40 45 

Ala Glu Val Gly Asp Cys Arg Gly Thr Ser Arg Lys Leu Arg Ala Arg 
50 55 60 

Arg Gly Gly Arg Asn Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 
65 70 75 80 

Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 
85 90 95 

His Asn Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 
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100 105 110 

Phe Pro Asp Asp Ala Lys Leu Thr Lys He Glu Thr Leu Arg Phe Ala 
115 120 125 

His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg lie Ala Asp His 
130 135 140 

Ser Phe Tyr Gly Pro Glu Pro Pro Val Pro Cys Gly Glu Leu Gly Ser 
145 150 155 160 

Pro Gly Gly Gly Ser Asn Gly Asp Trp Gly Ser lie Tyr Ser Pro Val 
165 170 175 

Ser Gin Ala Gly Asn Leu Ser Pro Thr Ala Ser Leu Glu Glu Phe Pro 
180 185 190 

Gly Leu Gin Val Pro Ser Ser Pro Ser Tyr Leu Leu Pro Gly Ala Leu 
195 200 205 

Val Phe Ser Asp Phe Leu 
210 



<210>26 

<211>72 

<212> DNA 

<213> Mus musculus 

<400>26 

atgcacgggc tgaatgcggc gctggacaac ccgagaaagg tggtaccttg ctactctaag 60 
acacagaagc to 72 

<210>27 

<211>24 

<212> PRT 

<213> Mus musculus 

<400> 27 

Met His Gly Leu Asn Ala Ala Leu Asp Asn Pro Arg Lys Val Val Pro 
15 10 15 

Cys Tyr Ser Lys Thr Gin Lys Leu 
20 



<210>28 

<211>66 

<212> DNA 

<213> Mus musculus 

<400> 28 

atgcatggcc tgaatgatgc cttggataat cttagaagag tcatgccatg ttactctaaa 60 
actcaa 66 



<210> 29 
<211>22 
<212>PRT 
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<213> Mus musculus 
<400> 29 

Met His Gly Leu Asn Asp Ala Leu Asp Asn Leu Arg Arg Val Met Pro 
1 5 10 15 

Cys Tyr Ser Lys Thr Gin 
20 



<210> 30 
<211>1385 
<212> DNA 
<213> Mus musculus 

<400> 30 

cttaggaagc gccaagcccg cggagcggag gacaccgtgc tcggttccgg gtgggggaca 60 
ttcccggaca cacaccggag cagcagctgc gccggaacat tggagccgcg taggtaagtg 1 20 
tgcatgccgc ggctttccat tcgcaggcag tgtccccacg caggctcacg ccgcccacgc 180 
taactccatc gtttagacgc agtgacttct gtgaccggca gaaggtggct cgagcccggg 240 
gcgctcctcc ccagctctgt cctcgccatc ttcgcgaatg cacattgagg gagatggagg 300 
ggggggggcg gggcgcggcg ccagcgacac tttaccctgt ccattctggg aataaatttc 360 
atctgcctct tctttctcag gatgttcgtc aaatctgaga ctctggagtt gaaggaggaa 420 
gaggaggtac tgatgctgct gggctcggct tccccggcct cggcgaccct gaccccgatg 480 
tcctccagcg cggacgagga ggaggacgag gagctgcgcc ggccgggctc cgcgcgtggg 540 
oagcgtggag cggaagccga gcagggggtg cagggcagtc cggcgtcggg tgccgggggt 600 
tgccggccag ggcggctgct gggcctgatg cacgagtgca agcgtcgccc gtcgcgctca 660 
cgggccgtct cccgaggtgc caagacggcg gagacggtgc agcgcatcaa gaagacccgc 720 
aggctcaagg ccaacaaccg cgagcgcaac cgcatgcaca acctaaacgc cgcgctggac 780 
gcgctgcgcg aggtgctgcc caccttcccc gaggatgcca agctcacgaa gatcgagacg 840 
ctgcgcttcg cccacaatta catctgggcg ctcaccgaga ctctgcgcct ggcggaccac 900 
tgcgccggcg ccggtggcct ccagggggcg ctcttcacgg aggcggtgct cctgagcccg 960 
ggagctgcgc tcggcgccag cggggacagc ccttctccac cttcctcctg gagctgcacc 1020 
aacagcccgg cgtcatcctc caactccacg tccccataca gctgcacttt atcgcccgct 1080 
agccccgggt cagacgtgga ctactggcag cccccacctc cggagaagca tcgttatgcg 1 140 
cctcacctgc ccctcgccag ggactgtatc tagagctgcg ggtctccxtc tctcgtctcc 1200 
tacccgggcc ctccttccca tccttctccc gccccccacc ctccacgccc cggaatccac 1260 
ttcacagaac agaagttggc cctttgcaat cccctccgcg gctggtgctt cgggggttgg 1320 
aaaacaactc tggtttattg aaattaagat tttggtcaaa aagaatatgc tttttggaat 1380 
tgggg 1385 



<210>31 
<211>263 
<212> PRT 
<213> Mus musculus 

<400> 31 

Met Phe Val Lys Ser Glu Thr Leu Glu Leu Lys Glu Glu Glu Glu Val 
1 5 10 15 

Leu Met Leu Leu Gly Ser Ala Ser Pro Ala Ser Ala Thr Leu Thr Pro 
20 25 30 

Met Ser Ser Ser Ala Asp Glu Glu Glu Asp Glu Glu Leu Arg Arg Pro 
35 40 45 

Gly Ser Ala Arg Gly Gin Arg Gly Ala Glu Ala Glu Gin Gly Val Gin 
50 55 60 
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Gly Ser Pro Ala Ser Gly Ala Gly Gly Cys Arg Pro Gly Arg Leu Leu 
65 70 75 80 

Gly Leu Met His Glu Cys Lys Arg Arg Pro Ser Arg Ser Arg Ala Val 
85 90 95 

Ser Arg Gly Ala Lys Thr Ala Glu Thr Val Gin Arg lie Lys Lys Thr 
100 105 110 

Arg Arg Leu Lys Ala Asn Asn Arg Glu Arg Asn Arg Met His Asn Leu 
115 120 125 

Asn Ala Ala Leu Asp Ala Leu Arg Glu Val Leu Pro Thr Phe Pro Glu 
130 135 140 

Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala His Asn Tyr 
145 150 155 160 

He Trp Ala Leu Thr Glu Thr Leu Arg Leu Ala Asp His Cys Ala Gly 
165 170 175 

Ala Gly Gly Leu Gin Gly Ala Leu Phe Thr Glu Ala Val Leu Leu Ser 
180 185 190 

Pro Gly Ala Ala Leu Gly Ala Ser Gly Asp Ser Pro Ser Pro Pro Ser 
195 200 205 

Ser Trp Ser Cys Thr Asn Ser Pro Ala Ser Ser Ser Asn Ser Thr Ser 
210 215 220 

Pro Tyr Ser Cys Thr Leu Ser Pro Ala Ser Pro Gly Ser Asp Val Asp 
225 230 235 240 

Tyr Trp Gin Pro Pro Pro Pro Glu Lys His Arg Tyr Ala Pro His Leu 
245 250 255 

Pro Leu Ala Arg Asp Cys lie 
260 



<210> 32 
<211>3541 
<212> DNA 
<213> Mus musculus 

<400> 32 

cagctgggca aggaaaatat gaaacacaag tgatctatat ccacagaata aacactaagt 60 
ttatgtgtct tttgacaagg tctcatgtag ctcagactgg ggttaaactt tttatgctac 120 
taaagataaa cttcaatatg ctaataaaga taaactcctg atttttctgc ttccacctct 180 
taaggcttgg gatcataggc atgtgccatc atgtctttct cagtttagat ttataccagg 240 
tattttgtgg tataacattt ttatattttc ctttttccag aatctataca tggcaaaaat 300 
gtatatgaaa tccaaggaca tggtggagct ggtcaacaca caatcctgga tggacaaagg 360 
tctgagctct caaaatgaga tgaaggagca agagagaaga ccgggctctt atggaatgct 420 
cggaacctta actgaagagc atgacagtat tgaggaggat gaagaagagg aagaagatgg 480 
agataaacct aaaagaagag gtcccaagaa aaagaagatg actaaagctc gccttgaaag 540 • 
attcagggct cgaagagtca aggccaatgc tagagaacgg acccggatgc atggcctgaa 600 
tgatgccttg gataatctta ggagagtcat gccatgttac tctaaaactc aaaagctttc 660 
caagatagag actcttcgac tggcaaggaa ctacatctgg gccttgtctg aagtcctgga 720 
gactggtcag acacttgaag ggaagggatt tgtagagatg ctatgtaaag gtctctctca 780 
acccacaagc aacctggttg ctggatgcct ccaactgggg cctcaatcta ccctcctgga 840 
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gaagcatgag gaaaaatctt caatttgtga ctctactatc tctgtccaca gcttcaacta 900 
tcagtctcca gggctcccca gccctcctta tggccatatg gaaacacatt ctctccatct 960 
caagcctcaa ccatttaaga gtttgggtga ctcttttggg agccatccac ctgactgcag 1020 
taccccccct tatgagggtc cactcacacc acccctgagc attagtggca acttctcctt 1080 
aaagcaagac ggctcccctg atttggaaaa atcctacaat ttcatgccac attatacctc 1 140 
tgcaagtcta agttcagggc atgtgcattc aactcccttt cagactggca ctccccgcta 1200 
tgatgttcct gtagacctga gctatgattc ctactcccac catagcattg gaactcagct 1260 
caatacgatc ttctctgatt agagcaataa gataagcacc aatatttcag agaatgaagt 1320 
ggagattttt ttcacatttc tagtggctga gctaaactct cagaaaattt aaaagaacct 1 380 
ttggatatgc atcaaacata atagtcctag tttgttcaga acttcctgta cctgctaact 1440 
ttcttcccat taacttctca cattggacca gtcctacatt tggtaaactt aagtgaatat 1 500 
atttgatggt ttgaggccac atggtaatag aacagaaaga aagcccaggc cctgttccaa 1560 
tggtgccaaa gattaattga atgctctgcc aattaacttt ccatttccag tgtttttatt 1620 
gctttctgat aaacatgaag caactgttcc aaatcaacat atacctttca cctcccacac 1680 
atttttaaat ttaaaagcaa tcaaagcaaa tagcaaaaat ggaatgatta tacagaactg 1740 
qaaggagcat caagtacatg tctgttggct tatagaatac aaaatttgtg tgaatttgac 1800 
aaatcatctt tgtgtctatt taaatataac ttccagagcc taaaataata acttggatgt 1860 
taccataaat aaccagtatg ttctttaaga gatcagctct acttattact gtgctgaaag 1920 
gtatacacac cttttttagt gattggagaa ccatgataga agcctcacac aaactttatt 1 980 
ctttatacta tttaaaaaac aactgtctta gtactaggag acaagcaaga aagacattga 2040 
aatttctctt ctggcacaca gaaatatcac ctagctcatt tcttagctcc cgtgactata 2100 
gggctgtaga attttgcagg tattcaggtg cttcagttag aatcagaact cagccaggtt 2160 
catactgtag agcaataaaa tggtggttgg ctgctatccc aactaacaca gttaaagaac 2220 
tctgcctcac acatagccac tggaaaaatg tggatattct ccagcaagat gaatttcact 2280 
gtttaaagca atgcaattaa agccatagag tttcgtccac tccagtatca tatattccag 2340 
aactgttgta atcaatcttg aattcttaca acataaatgc aactccttac ttcccaccta 2400 
acactgattg ttatattgtc ttcaattcca agatattatg caattatatg caaaattttg 2460 
attagaatca aaattaagag tcaatgaatc tgtctgtatc ttcaggacgg gttttgatca 2520 
gttttaagaa agtttatttt cctttatgtg gcatctcttt tctttgtaac cacactggtt 2580 
cagccaagtt tctcttctcc agagaaatta gctctgagaa attttactat catgatccat 2640 
cttccacagc aattatttag gttcaactca agagtataca tagtttattt atagtgggtg 2700 
aggatacacc tccaagaata aattttaaca acattaatga catatgaata tgccatttta 2760 
tctaccaaac tatatatgta tctcttttct ttattgccct tatttatttc ttcacactga 2820 
gagttatttt gtgtccatct tattgcagca cttactctgc tctactttgc acctttggat 2880 
tataaatatg tttaaaagtc tgtaaagacg tcttaaacaa ctcgtgacag taattcacca 2940 
cccctaagac cttgaatcac cctagtggaa ataggcaagg agaattattt atagaatcat 3000 
cctatgtaat tttttttgag aatttgctct acctagcatt tatgtttata gataattgct 3060 
atctgcatta tttattaggt tctatttatt taatttatct ttctttcttt ttatgtaaac 3120 
atttgtgccc catagatata gcctcaaagc ttcactggga aactagctta tatgtttgga 3180 
gtgagagaaa aggagaaaat cagttcttga ttgcttgcaa tggttttata aaacagagca 3240 
ataatttgaa tagatatgca acttaatggt tttagaattt ttcctttaag gtgcaacaga 3300 
gttacattat tatttatgac tttggagaat gtagtacatg tgaaccagga ctgtaggctt 3360 
gtgaagagag attttataat taaatacaaa tttagtactg tactatgctt ggaaagaact 3420 
tgttctttaa ataatgttta gtcttctggg agtgttttca gataaaatga agcaattgtt 3480 
taaaagaaat ggctgttttc cctccctttt ccagtagcaa taaagctttg agtgttatta 3540 
c 3541 



<210> 33 
<211>330 
<212> PRT 
<213> Mus musculus 

<400> 33 

Met Ala Lys Met Tyr Met Lys Ser Lys Asp Met Val Glu Leu Val Asn 
1 5 10 15 

Thr Gin Ser Trp Met Asp Lys Gly Leu Ser Ser Gin Asn Glu Met Lys 
20 25 30 
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Glu Gin Glu Arg Arg Pro Gly Ser Tyr Gly Met Leu Gly Thr Leu Thr 
35 40 45 

Glu Glu His Asp Ser lie Glu Glu Asp Glu Glu Glu Glu Glu Asp Gly 
50 55 60 

Asp Lys Pro Lys Arg Arg Gly Pro Lys Lys Lys Lys Met Thr Lys Ala 
65 70 75 80 

Arg Leu Glu Arg Phe Arg Ala Arg Arg Val Lys Ala Asn Ala Arg Glu 
85 90 95 

Arg Thr Arg Met His Gly Leu Asn Asp Ala Leu Asp Asn Leu Arg Arg 
100 105 110 

Val Met Pro Cys Tyr Ser Lys Thr Gin Lys Leu Ser Lys lie Glu Thr 
115 120 125 

Leu Arg Leu Ala Arg Asn Tyr lie Trp Ala Leu Ser Glu Val Leu Glu 
130 135 140 

Thr Gly Gin Thr Leu Glu Gly Lys Gly Phe Val Glu Met Leu Cys Lys 
145 150 155 160 

Gly Leu Ser Gin Pro Thr Ser Asn Leu Val Ala Gly Cys Leu Gin Leu 
165 170 175 

Gly Pro Gin Ser Thr Leu Leu Glu Lys His Glu Glu Lys Ser Ser lie 
180 185 190 

Cys Asp Ser Thr lie Ser Val His Ser Phe Asn Tyr Gin Ser Pro Gly 
195 200 205 

Leu Pro Ser Pro Pro Tyr Gly His Met Glu Thr His Ser Leu His Leu 
210 215 220 

Lys Pro Gin Pro Phe Lys Ser Leu Gly Asp Ser Phe Gly Ser His Pro 
225 230 235 240 

Pro Asp Cys Ser Thr Pro Pro Tyr Glu Gly Pro Leu Thr Pro Pro Leu 
245 250 255 

Ser lie Ser Gly Asn Phe Ser Leu Lys Gin Asp Gly Ser Pro Asp Leu 
260 265 270 

Glu Lys Ser Tyr Asn Phe Met Pro His Tyr Thr Ser Ala Ser Leu Ser 
275 280 285 

Ser Gly His Val His Ser Thr Pro Phe Gin Thr Gly Thr Pro Arg Tyr 
290 295 300 

Asp Val Pro Val Asp Leu Ser Tyr Asp Ser Tyr Ser His His Ser He 
305 310 315 320 

Gly Thr Gin Leu Asn Thr lie Phe Ser Asp 
325 330 



<210>34 
<211>800 
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<212> DNA 

<213> Mus musculus 

<400> 34 

agggtagtag agtcactaat actacgtcct tgctgtgtag ttttctcagg ttttgattca 60 
ttttaagcat gggccaccaa tctcatgtac ctttttagct ttctttccaa gtgtgtctcc 120 
agttttctga cccccctcct gttttgcctg ctctaccctg ccttgtacct ttctaatcac 180 
aagtcttttc agttccctta gttttcaccc atcaacttca gcagcccaca ccctctagtt 240 
ccttcctggt ttaaacaaaa acaaacacgc agtggcaaag ctggacctgg tcagagaagc 300 
cttgtgaagg aggtgtgtct ttaggctagg aaggggaggg gctaccctgt gggcaacatc 360 
tcccgccctg gtcagcagcc aaaaccagca aaacggcggc aagtcagaag ctccagtcag 420 
atcacaggag ctgcccagag actgtggtac tgaaagaact actcgcggga gctgaccccg 480 
ggaaagaggt actgaaaaga catagaaaac cagctgtggt ggaggcactg acatgaaggc 540 
atcctggtag tgcatcagaa ctccaggaaa aagtagaata agtaacagcc aaggtactca 600 
gaccaggaac agtcactaga aggtagctac cagtttaaca tggacgactg aaagggtctt 660 
ctgtttccca cgatctgcct ggtcaggtca gggtagaact gactgctctg atagttcttc 720 
aggacacaga ttagagttta atcttggaac tggacttcca gaggtgagcc tgtgaacggg 780 
gtgtgggtac taaagtttct 800 



<210> 35 
<211>515 
<212> DNA 
<213> chicken 

<400> 35 

tttttctgta tgcgtgtgaa gctgcatctg caatagatat gaaaacctgt caatccagtc 60 
atttggattc aggagtagaa tcagacatcc agtgcagaag tggatcaggc tgtgttgtga 120 
agtgcagcac agaaagaatg gagagcgctg ccaagagaag actggctgcc aacgccaggg 180 
3gagaagacg gatgcaagga ctgaacacag cctttgatcg tttgaggaag gtggttccac 240 
agtggggtca agataagaag ctctccaaat atgagaccct tcagatggct ttgagttata 300 
tcatggctct aacacgaata cttgctgaag cagagagata cagtactgaa agagaatgga 360 
ttaaccttca ctgtgaacac tttcatccag agagctacca ccattatacg ggacaaaaag 420 
tggcaacaga cagtgatcct tatgcacagc gaatattcag ctatcaccct gaacactttc 480 
aaatagctaa ttagaactta ttacgagcta aaaaa 51 5 



<210> 36 
<211>151 
<212> PRT 
<213> chicken 

<400> 36 

IVlet Lys Thr Cys Gin Ser Ser His Leu Asp Ser Gly Val Glu Ser Asp 
1 5 10 15 

lie Gin Cys Arg Ser Gly Ser Gly Cys Val Val Lys Cys Ser Thr Glu 
20 25 30 

Arg Met Glu Ser Ala Ala Lys Arg Arg Leu Ala Ala Asn Ala Arg Glu 
35 40 45 

Arg Arg Arg Met Gin Gly Leu Asn Thr Ala Phe Asp Arg Leu Arg Lys 
50 55 60 

Val Val Pro Gin Trp Gly Gin Asp Lys Lys Leu Ser Lys Tyr Glu Thr 
65 70 75 80 

Leu Gin Met Ala Leu Ser Tyr lie Met Ala Leu Thr Arg lie Leu Ala 
85 90 95 
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Glu Ala Glu Arg Tyr Ser Thr Glu Arg Glu Tip He Asn Leu His Cys 
100 105 110 

Glu His Phe His Pro Glu Ser Tyr His His Tyr Thr Gly Gin Lys Val 
115 120 125 

Ala Thr Asp Ser Asp Pro Tyr Ala Gin Arg lie Phe Ser Tyr His Pro 
130 135 140 

Glu His Phe Gin lie Ala Asn 
145 150 



<210> 37 

<211> 1412 

<212>DNA 

<213> Mus musculus 

<400> 37 

cgtgctcggt tccgggctgc ggggacattc ccggacacac accggagcag cagctgcgcc 60 
gcgacacatc tggagccgcg taggatgttc gtcaaatctg agactctgga gttgaaggag 120 
gaagaggagg tactgatgct gctgggctcg gcttccccgg cctcggcgac cctgaccccg 180 
atgtcctcca gcgcggacga ggaggaggac gaggagctgc gccggccggg ctccgcgcgt 240 
gggcagcgtg gagcggaagc cgggcagggg gtgcagggca gtccggcgtc gggtgccggg 300 
ggttgccggc cagggcggct gctgggcctg atgcacgagt gcaagcgtcg cccgtcgcgc 360 
tcacgggccg tctcccgagg tgccaagacg gcggagacgg tgcagcgcat caagaagacc 420 
cgcaggctca aggccaacaa ccgcgagcgc aaccgcatgc acaacctaaa cgccgcgctg 480 
gacgcgctgc gcgaggtgct gcccaccttc cccgaggatg ccaagctcac gaagatcgag 540 
acgctgcgct tcgcccacaa ttacatctgg gcgctcaccg agactctgcg cctggcggac 600 
cactgcgccg gcgccggtgg cctccagggg gcgctcttca cggaggcggt gctcctgagc 660 
ccgggagctg cgctcggcgc cagcggggac agcccttctc caccttcctc ctggagctgc 720 
accaacagcc cggcgtcatc ctccaactcc acgtccccat acagctgcac tttatcgccc 780 
gctagccccg ggtcagacgt ggactactgg cagcccccac ctccggagaa gcatcgttat 840 
gcgcctcacc tgcccctcgc cagggactgt atctagagct gcgggtctcc ctctctcgtc 900 
ctctacccgg ccctcttccc atccttctcc cgcccctcac cx:tccacgcc ccggactcca 960 
ottcacagag cagaggtggc ccttgcaatc ccctcggcgg ctggtgcatt cgggggtgga 1020 
gaccagctct ggtttattga agatgtgagg atttatggtc aaagaggact atggcgtgtg 1080 
ggagtggggg ctggcgtggg gaacxtcgta agactgtaaa agacactgag aaaaagtacc 1140 
ataactaacg agtgtgcaga gcagactgac gctcctcccc tctctcagag ctgctggagg 1200 
agaactccgg gcaggcagtt cgtgtgaatc tctcagaggg aatgcaactg gtccctgtga 1260 
tcttttcacc ttcgtttcta catagagatg ttaatgtcag tcgaaagaaa tgtattttag 1320 
catctgaatg aatttactgg taataatatt atccacacat ttgcaatggc tggcatctgc 1380 
tctattccca ttgctgtctg caggctgtgg ga 1412 



<210> 38 
<211>263 
<212> PRT 
<213> Mus musculus 

<400> 38 

Met Phe Val Lys Ser Glu Thr Leu Glu Leu Lys Glu Glu Glu Glu Val 
1 5 10 15 

Leu Met Leu Leu Gly Ser Ala Ser Pro Ala Ser Ala Thr Leu Thr Pro 
20 25 30 

Met Ser Ser Ser Ala Asp Glu Glu Glu Asp Glu Glu Leu Arg Arg Pro 
35 40 45 
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Gly Ser Ala Arg Gly Gin Arg Gly Ala Glu Ala Gly Gin Gly Val Gin 
50 55 60 

Gly Ser Pro Ala Ser Gly Ala Gly Gly Cys Arg Pro Gly Arg Leu Leu 
65 70 75 80 

Gly Leu Met His Glu Cys Lys Arg Arg Pro Ser Arg Ser Arg Ala Val 
85 90 95 

Ser Arg Gly Ala Lys Thr Ala Glu Thr Val Gin Arg lie Lys Lys Thr 
100 105 110 

Arg Arg Leu Lys Ala Asn Asn Arg Glu Arg Asn Arg Met His Asn Leu 
115 120 125 

Asn Ala Ala Leu Asp Ala Leu Arg Glu Val Leu Pro Thr Phe Pro Glu 
130 135 140 

Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala His Asn Tyr 
145 150 155 160 

lie Trp Ala Leu Thr Glu Thr Leu Arg Leu Ala Asp His Cys Ala Gly 
165 170 175 

Ala Gly Gly Leu Gin Gly Ala Leu Phe Thr Glu Ala Val Leu Leu Ser 
180 185 190 

Pro G!y Ala Ala Leu Gly Ala Ser Gly Asp Ser Pro Ser Pro Pro Ser 
195 200 205 

Ser Trp Ser Cys Thr Asn Ser Pro Ala Ser Ser Ser Asn Ser Thr Ser 
210 215 220 

Pro Tyr Ser Cys Thr Leu Ser Pro Ala Ser Pro Gly Ser Asp Val Asp 
225 230 235 240 

Tyr Trp Gin Pro Pro Pro Pro Glu Lys His Arg Tyr Ala Pro His Leu 
245 250 255 

Pro Leu Ala Arg Asp Cys lie 
260 



<210> 39 
<211>938 
<212> DNA 
<213> Zebra Fish 

<400> 39 

cttatagggc tcgagcggcc gcccgggcag gtcaggtgta agaagagaaa tcttcccagg 60 
caaaatatcc gtccctgtat ccatagccac aaactttcct cccaaaagca caaaccaaca 120 
gaatggatgg aatgagcacg gatacaagag aggtggttga actcgacgtc cagcattcga 180 
gcttggggcg gggggagcag agcaagtacc caccagcctt ggcactcatg gccagcagtg 240 
acccacgcgc ctggctggct cccgtgcagg ctggcacctg cgcggcacac gccgaatacc 300 
tgctgcactc gcccggctcg agcgcggaag gcgtgtcctc tgcctccaac ttcaggaaga 360 
gcagcaagag tcctgtcaaa gtacgcgagc tctgccggct taaaggagct gtgggggcag 420 
atgagggcag acagcgggcc ccatccagca aatccaccaa cgtcgtgcag aaacagaggc 480 
gaatggctgc caatgcccgg gagaggcgaa gaatgcacgg attgaaccac gcgttcgacg 540 
agctgcgcag tgtcatccca gcctttgaca acgacaagaa actctccaag tacgaaaccc 600 
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tgcagatggc ccagatctac atcaacgccc tgtccgactt actacagggc cccggtgcta 660 
aagccgaccc gccaaactgc gacctgctgc atgccaacgt gttagaaacg gaccgatctc 720 
ccagaggatc accgggcgtc tgtcggagag gcacgggcgt gggttacccg taccagtacg 780 
dggacggaac attcaactct ttcatggagc aagacctcca gtcgccctct ggaacgagca 840 
agtctggttc ggaggccagt aaagactcgc ctcggtcgaa ccggagtgat ggagaagttc 900 
tcgcctcact gaagtgcgag tgagacctgc ccgggcgg 938 



<210> 40 
<211>266 
<212> PRT 
<213> Zebra Fish 

<400> 40 

Met Asp Gly Met Ser Thr Asp Thr Arg Glu Val Val Glu Leu Asp Val 
15 10 15 

Gin His Ser Ser Leu Gly Arg Gly Glu Gin Ser Lys Tyr Pro Pro Ala 
20 25 30 

Leu Ala Leu Met Ala Ser Ser Asp Pro Arg Ala Trp Leu Ala Pro Val 
35 40 45 

Gin Ala Gly Thr Cys Ala Ala His Ala Glu Tyr Leu Leu His Ser Pro 
50 55 60 

Gly Ser Ser Ala Glu Gly Val Ser Ser Ala Ser Asn Phe Arg Lys Ser 
65 70 75 80 

Ser Lys Ser Pro Val Lys Val Arg Glu Leu Cys Arg Leu Lys Gly Ala 
85 90 95 

Val Gly Ala Asp Glu Gly Arg Gin Arg Ala Pro Ser Ser Lys Ser Thr 
100 105 110 

Asn Val Val Gin Lys Gin Arg Arg Met Ala Ala Asn Ala Arg Glu Arg 
115 120 125 

Arg Arg Met His Gly Leu Asn His Ala Phe Asp Glu Leu Arg Ser Val 
130 135 140 

lie Pro Ala Phe Asp Asn Asp Lys Lys Leu Ser Lys Tyr Glu Thr Leu 
145 150 155 160 

Gin Met Ala Gin lie Tyr lie Asn Ala Leu Ser Asp Leu Leu Gin Gly 
165 170 175 

Pro Gly Ala Lys Ala Asp Pro Pro Asn Cys Asp Leu Leu His Ala Asn 
180 . 185 190 

Val Leu Glu Thr Asp Arg Ser Pro Arg Gly Ser Pro Gly Val Cys Arg 
195 200 205 

Arg Gly Thr Gly Val Gly Tyr Pro Tyr Gin Tyr Glu Asp Gly Thr Phe 
210 215 220 

Asn Ser Phe Met Glu Gin Asp Leu Gin Ser Pro Ser Gly Thr Ser Lys 
225 230 235 240 

Ser Gly Ser Glu Ala Ser Lys Asp Ser Pro Arg Ser Asn Arg Ser Asp 
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245 250 255 

Gly Glu Val Leu Ala Ser Leu Lys Cys Glu 
260 265 



<210>41 
<211>948 
<212> DNA 
<213> Frog 

<400> 41 

atgtcagaga tggtcaatgt gcatgggtgg atggaggaag cccttagttc ccaggatgag 60 
atgaaggaga ggaatcagtc tgcctatgat atcatttcag gtctatgcca tgaggaaagg 1 20 
ggcagcattg atggagaaga ggatgatgaa gaagaagagg atggagagaa accaaaaaag 180 
aggggaccca aaaaaaagaa gatgaccaag gctagagtgg agaggttccg tgtccgtaga 240 
gtaaaagcca atgccaggga gcgttcaaga atgcatggac ttaatgatgc cctggaaaat 300 
ttgagaaggg ttatgccttg ctattccaaa acacaaaagt tgtctaaaat tgagactctt 360 
agactggcca gaaactatat atgggcatta tctgatattc tagaacaagg tcaaaatgca 420 
gagggaaagg gctttctgga aatactctgc aaaggtcttt ctcagccaac aagcaactta 480 
gtagctggct gcttgcaact tggacctcag gccatgttct tggataaaca cgaagaaaag 540 
tctcatatat gtgattcctc tcttactggt catacttata attaccagtc cccaggacta 600 
cccagtcctc cttatggtaa cattgatgtt caccacttgc acttgaaacc ctcttctttc 660 
aaaccagtaa tggatccttc tgtggtaacc catacactta actgtaccac tccaccatat 720 
gaaggagctc taacacctcc actcagcatc ggtggtaatt tttctttgaa gcaagatagt 780 
tcacccgata tggataaatc atatgcattc aggtccccct atccagctct tgggcttggt 840 
ggatctcatg gacatgcgtc acactttcat accagtgttc caaggtatga actacccata 900 
gacatggctt acgagcctta cccacaccat gctatattca ctgaataa 948 



<210> 42 
<211>315 
<212>PRT 
<213> Frog 

<400>42 

Met Ser Glu Met Val Asn Val His Gly Trp Met Glu Glu Ala Leu Ser 
1 5 10 15 

Ser Gin Asp Glu Met Lys Glu Arg Asn Gin Ser Ala Tyr Asp lie lie 
20 25 30 

Ser Gly Leu Cys His Glu Glu Arg Gly Ser lie Asp Gly Glu Glu Asp 
35 40 45 

Asp Glu Glu Glu Glu Asp Gly Glu Lys Pro Lys Lys Arg Gly Pro Lys 
50 55 60 

Lys Lys Lys Met Thr Lys Ala Arg Val Glu Arg Phe Arg Val Arg Arg 
65 70 75 80 

Val Lys Ala Asn Ala Arg Glu Arg Ser Arg Met His Gly Leu Asn Asp 
85 90 95 

Ala Leu Glu Asn Leu Arg Arg Val Met Pro Cys Tyr Ser Lys Thr Gin 
100 105 110 

Lys Leu Ser Lys He Glu Thr Leu Arg Leu Ala Arg Asn Tyr lie Trp 
115 120 125 
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Ala Leu Ser Asp lie Leu Glu Gin Gly Gin Asn Ala Gtu Gly Lys Gly 
130 135 140 

Phe Leu Glu lie Leu Cys Lys Gly Leu Ser Gin Pro Thr Ser Asn Leu 
145 150 155 160 

Val Ala Gly Cys Leu Gin Leu Gly Pro Gin Ala Met Phe Leu Asp Lys 
165 170 175 

His Glu Glu Lys Ser His lie Cys Asp Ser Ser Leu Thr Gly His Thr 
180 185 190 

Tyr Asn Tyr Gin Ser Pro Gly Leu Pro Ser Pro Pro Tyr Gly Asn He 
195 200 205 

Asp Val His His Leu His Leu Lys Pro Ser Ser Phe Lys Pro Val Met 
210 215 220 

Asp Pro Ser Val Val Thr His Thr Leu Asn Cys Thr Thr Pro Pro Tyr 
225 230 235 240 

Glu Gly Ala Leu Thr Pro Pro Leu Ser lie Gly Gly Asn Phe Ser Leu 
245 250 255 

Lys Gin Asp Ser Ser Pro Asp l\^et Asp Lys Ser Tyr Ala Phe Arg Ser 
260 265 270 

Pro Tyr Pro Ala Leu Gly Leu Gly Gly Ser His Gly His Ala Ser His 
275 280 285 

Phe His Thr Ser Val Pro Arg Tyr Glu Leu Pro He Asp Met Ala Tyr 
290 295 300 

Glu Pro Tyr Pro His His Ala He Phe Thr Glu 
305 310 315 



<210> 43 
<211>1550 
<212>DNA 
:213>Musmusculus 

<400> 43 

tttaaatact gtgctggcat tttttttcag gtaaattaag gttaaagaac catgttaaca 60 
ctaccgtttg acgagtctgt cgtaatgccc gaatcccaga tgtgcagaaa gtttgctaga 120 
caatgtgagg accagaaaca aattaagaaa ccagagagct ttccaaaaca agttgtcctt 180 
cgaggaaaga gcattaaaag ggcccctgga gaagaaaccg agaaagaaga ggaggaagaa 240 
gacagagagg aagaagatga gaatggcttg tccagaagga gggggctcag gaaaaaaaag 300 
accaccaaac tacgactgga aagggtcaag ttcaggagac aggaagctaa tgcgcgcgag 360 
aggaaccgga tgcacggcct caatgatgct ctggacaatt tgcgaaaagt ggtcccctgt 420 
tactctaaaa cccaaaaact gtccaaaata gaaactttac gactggccaa aaattacatc 480 
tgggcacttl ctgaaattct gaggattggc aagagaccgg atctgctcac gttcgtccaa 540 
aacttatgca aaggtctttc ccagccaact acaaacttgg tggcaggctg cttacagctc 600 
aacgccagaa gtttcctgat gggtcagggt ggggaggctg cccaccacac aaggtcaccc 660 
tactccacat tctacccacc ctaccacagc cctgagctgg ccactccccc agggcatggg 720 
actcttgata attccaagtc catgaaaccc tacaattact gcagtgcata tgaatccttc 780 
tatgaaagta cctcccctga gtgtgccagc cctcagtttg aaggtccctt aagtcctccc 840 
ccaattaact ataatgggat attttccctg aagcaagaag aaaccttgga ctatggcaaa 900 
aattacaatt atggcatgca ttactgtgca gtgccaccca ggggtcccct tgggcagggt 960 
gccatgttca ggttgcccac cgacagccac ttoccttacg acttacatct gcgcagccaa 1020 
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vctctcacta tgcaagatga attaaatgca gtttttcata attaatgagg aaaattaaaa 1080 
taaacagtgg tcattcacct cccactctaa ttaaggcaaa gcagatgctt gtgggctgag 1 140 
taattggcac aactctatct aaggtgttta ctagtttctg aagtgtgttt caaagattgt 1200 
gaccattttc tatgtcataa taaatccctt ttcgtatgag aacttccttt ccttccctct 1260 
tgtctgtatc acactgtgat tctctctctc tctctctctc tctctctctc tctctctctc 1320 
tcttactggc agaatatttc tttcttgttt tagtttcttt caaattcact taatttgttt 1380 
gaacaaggtg tctaagatgt tgctgaataa agacatgcac acagcatact tcaatgtcta 1440 
tttcagttgt acagttatga tgaaaatgca tgttataaaa atcagatgag taaaatgtgt 1500 
ttataattac taggattcat atatgtatct ctgaaatttt agtttttaaa 1 550 



<210>44 
<211>337 
<212> PRT 
<213> Mus musculus 

<400> 44 

Met Leu Thr Leu Pro Phe Asp Glu Ser Val Val Met Pro Glu Ser Gin 
1 5 10 15 

Met Cys Arg Lys Phe Ala Arg Gin Cys Glu Asp Gin Lys Gin lie Lys 
20 25 30 

Lys Pro Glu Ser Phe Pro Lys Gin Val Val Leu Arg Gly Lys Ser lie 
35 40 45 

Lys Arg Ala Pro Gly Glu Glu Thr Glu Lys Glu Glu Glu Glu Glu Asp 
50 55 60 

Arg Glu Glu Glu Asp Glu Asn Gly Leu Ser Arg Arg Arg Gly Leu Arg 
65 70 75 80 

Lys Lys Lys Thr Thr Lys Leu Arg Leu Glu Arg Val Lys Phe Arg Arg 
85 90 95 

Gin Glu Ala Asn Ala Arg Glu Arg Asn Arg Met His Gly Leu Asn Asp 
100 105 110 

Ala Leu Asp Asn Leu Arg Lys Val Val Pro Cys Tyr Ser Lys Thr Gin 
115 120 125 

Lys Leu Ser Lys lie Glu Thr Leu Arg Leu Ala Lys Asn Tyr He Trp 
130 135 140 

Ala Leu Ser Glu lie Leu Arg lie Gly Lys Arg Pro Asp Leu Leu Thr 
145 150 155 160 

Phe Val Gin Asn Leu Cys Lys Gly Leu Ser Gin Pro Thr Thr Asn Leu 
165 170 175 

Val Ala Gly Cys Leu Gin Leu Asn Ala Arg Ser Phe Leu Met Gly Gin 
180 185 190 

Gly Gly Glu Ala Ala His His Thr Arg Ser Pro Tyr Ser Thr Phe Tyr 
195 200 205 

Pro Pro Tyr His Ser Pro Glu Leu Ala Thr Pro Pro Gly His Gly Thr 
210 215 220 

Leu Asp Asn Ser Lys Ser Met Lys Pro Tyr Asn Tyr Cys Ser Ala Tyr 
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225 230 235 240 

Glu Ser Phe Tyr Glu Ser Thr Ser Pro Glu Cys Ala Ser Pro Gin Phe 
245 250 255 

Glu Gly Pro Leu Ser Pro Pro Pro lie Asn Tyr Asn Gly He Phe Ser 
260 265 270 

Leu Lys Gin Glu Glu Thr Leu Asp Tyr Gly Lys Asn Tyr Asn Tyr Gly 
275 280 285 

Met His Tyr Cys Ala Val Pro Pro Arg Gly Pro Leu Gly Gin Gly Ala 
290 295 300 

Met Phe Arg Leu Pro Thr Asp Ser His Phe Pro Tyr Asp Leu His Leu 
305 310 315 320 

Arg Ser Gin Ser Leu Thr Met Gin Asp Glu Leu Asn Ala Val Phe His 
325 330 335 

Asn 



<210> 45 
<211>1393 
<212> DNA 
<213> Mus musculus 

<400> 45 

aagcttcgtt gcacgcgacc tggtgtgcga tctccgagtg agagggggag ggtcagagga 60 
ggaaggaaaa aaaatcagac cttgcagaag agactaggaa ggtttttgtt gttgttgttc 120 
ggggcttatc cccttcgttg aactgggttg ccagcacctc ctctaacacg gcacctccga 1 80 
gccattgcag tgcgatgtcc cgcctgctgc atgcagaaga gtgggctgag gtaaaagagt 240 
tgggggacca ccatcgccat ccccagccgc accacgtccc gccgctgacg ccacagccac 300 
ctgctaccct gcaggcgaga gaccttcccg tctacccggc agaactgtcc ctcctggata 360 
gcaccgaccc acgcgcctgg ctgactccca ctttgcaggg cctctgcacg gcacgcgccg 420 
cccagtatct gctgcattct cccgagctgg gtgcctccga ggccgcggcg ccccgggacg 480 
aggctgacag ccagggtgag ctggtaagga gaagcggctg tggcggcctc agcaagagcc 540 
ccgggcccgt caaagtacgg gaacagctgt gcaagctgaa gggtggggtt gtagtggacg 600 
agcttggctg cagccgccag cgagcccctt ccagcaaaca ggtgaatggg gtacagaagc 660 
aaaggaggct ggcagcaaac gcaagggaac ggcgcaggat gcacgggctg aaccacgcct 720 
tcgaccagct gcgcaacgtt atcccgtcct tcaacaacga caagaagctg tccaaatatg 780 
agaccctaca gatggcccag atctacatca acgctctgtc ggagttgctg cagactccca 840 
atgtcggaga gcaaccgccg ccgcccacag cttcctgcaa aaatgaccac catcaccttc 900 
gcaccgcctc ctcctatgaa ggaggtgcgg gcgcctctgc ggtagctggg gctcagccag 960 
ccccgggagg gggcccgaga cctaccccgc ccgggccttg ccggactcgc ttctcaggcc 1020 
cagcttcctc tgggggttac tcggtgcagc tggacgcttt gcacttccca gccttcgagg 1080 
acagggccct aacagcgatg atggcacaga aggacctgtc gccttcgctg cccgggggca 1 140 
tcctgcagcc tgtacaggag gacaacagca aaacatctcc cagatcccac agaagtgacg 1200 
gagagttttc cccccactct cattacagtg actctgatga ggccagttag gaaggcaaca 1260 
gctccctgaa aactgagaca accaaatgcc cttcctagcg cgcgggaagc cccgtgacaa 1320 
atatccctgc accctttaat ttttggtctg tggtgatcgt tgttagcaac gacttgactt 1380 
cggacggctg cag 1393 



<210> 46 

<211>351 

<212> PRT 

<213> Mus musculus 
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<400>46 

Met Ser Arg Leu Leu His Ala Glu Glu Tip Ala Glu Val Lys Glu Leu 
15 10 15 

Gly Asp His His Arg His Pro Gin Pro His His Val Pro Pro Leu Thr 
20 25 30 

Pro Gin Pro Pro Ala Thr Leu Gin Ala Arg Asp Leu Pro Val Tyr Pro 
35 40 45 

Ala Glu Leu Ser Leu Leu Asp Ser Thr Asp Pro Arg Ala Trp Leu Thr 
50 55 60 

Pro Thr Leu Gin Gly Leu Cys Thr Ala Arg Ala Ala Gin Tyr Leu Leu 
65 70 75 80 

His Ser Pro Glu Leu Gly Ala Ser Glu Ala Ala Ala Pro Arg Asp Glu 
85 90 95 

Ala Asp Ser Gin Gly Glu Leu Val Arg Arg Ser Gly Cys Gly Gly Leu 
100 105 110 

Ser Lys Ser Pro Gly Pro Val Lys Val Arg Glu Gin Leu Cys Lys Leu 
115 120 125 

Lys Gly Gly Val Val Val Asp Glu Leu Gly Cys Ser Arg Gin Arg Ala 
130 135 140 

Pro Ser Ser Lys Gin Val Asn Gly Val Gin Lys Gin Arg Arg Leu Ala 
145 150 155 160 

Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn His Ala Phe 
165 170 175 

Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp Lys Lys Leu 
180 185 190 

Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr He Asn Ala Leu 
195 200 205 

Ser Glu Leu Leu Gin Thr Pro Asn Val Gly Glu Gin Pro Pro Pro Pro 
210 215 220 

Thr Ala Ser Cys Lys Asn Asp His His His Leu Arg Thr Ala Ser Ser 
225 230 235 240 

Tyr Glu Gly Gly Ala Gly Ala Ser Ala Val Ala Gly Ala Gin Pro Ala 
245 250 255 

Pro Gly Gly Gly Pro Arg Pro Thr Pro Pro Gly Pro Cys Arg Thr Arg 
260 265 270 

Phe Ser Gly Pro Ala Ser Ser Gly Gly Tyr Ser Val Gin Leu Asp Ala 
275 280 285 

Leu His Phe Pro Ala Phe Glu Asp Arg Ala Leu Thr Ala Met Met Ala 
290 295 300 

Gin Lys Asp Leu Ser Pro Ser Leu Pro Gly Gly lie Leu Gin Pro Val 
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305 310 315 320 

Gin Glu Asp Asn Ser Lys Thr Ser Pro Arg Ser His Arg Ser Asp Gly 
325 330 335 

Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu Ala Ser 
340 345 350 



<210> 47 

<211>993 

<212> DNA 

<213> Mus musculus 

<400> 47 

atggcaaaaa tgtatatgaa atccaaggac atggtggagc tggtcaacac acaatcctgg 60 
atggacaaag gtctgagctc tcaaaatgag atgaaggagc aagagagaag accgggctct 120 
tatggaatgc tcggaacctt aactgaagag catgacagta ttgaggagga tgaagaagag 180 
gaagaagatg gagataaacc taaaagaaga ggtcccaaga aaaagaagat gactaaagct 240 
cgccttgaaa gattcagggc tcgaagagtc aaggccaatg ctagagaacg gacccggatg 300 
catggcctga atgatgcctt ggataatctt aggagagtca tgccatgtta ctctaaaact 360 
caaaagcttt ccaagataga gactcttcga ctggcaagga actacatctg ggccttgtct 420 
gaagtcctgg agactggtca gacacttgaa gggaagggat ttgtagagat gctatgtaaa 480 
ggtctctctc aacccacaag caacctggtt gctggatgcc tccaactggg gcctcaatct 540 
accctcctgg agaagcatga ggaaaaatct tcaatttgtg actctactat ctctgtccac 600 
agcttcaact atcagtctcc agggctcccc agccctcctt atggccatat ggaaacacat 660 
tctctccatc tcaagcctca accatttaag agtttgggtg actcttttgg gagccatcca 720 
cctgactgca gtaccccccc ttatgagggt ccactcacac cacccctgag cattagtggc 780 
aacttctcct taaagcaaga cggctcccct gatttggaaa aatcctacaa tttcatgcca 840 
cattatacct ctgcaagtct aagttcaggg catgtgcatt caactccctt tcagactggc 900 
actccccgct atgatgttcc tgtagacctg agctatgatt cctactccca ccatagcatt 960 
ggaactcagc tcaatacgat cttctctgat tag 993 



<210>48 _ 

<211>330 

<212> PRT 

<213> Mus musculus 

<400>48 

Met Ala Lys Met Tyr Met Lys Ser Lys Asp Met Val Glu Leu Val Asn 
1 5 10 15 

Thr Gin Ser Trp Met Asp Lys Gly Leu Ser Ser Gin Asn Glu Met Lys 
20 25 30 

Glu Gin Glu Arg Arg Pro Gly Ser Tyr Gly Met Leu Gly Thr Leu Thr 
35 40 45 

Glu Glu His Asp Ser lie Glu Glu Asp Glu Glu Glu Glu Glu Asp Gly 
50 55 60 

Asp Lys Pro Lys Arg Arg Gly Pro Lys Lys Lys Lys Met Thr Lys Ala 
65 70 75 80 

Arg Leu Glu Arg Phe Arg Ala Arg Arg Val Lys Ala Asn Ala Arg Glu 
85 90 95 

Arg Thr Arg Met His Gly Leu Asn Asp Ala Leu Asp Asn Leu Arg Arg 
100 105 110 
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Val Met Pro Cys Tyr Ser Lys Thr Gin Lys Leu Ser Lys lie 6lu Thr 
115 120 125 

Leu Arg Leu Ala Arg Asn Tyr lie Trp Ala Leu Ser Glu Val Leu Glu 
130 135 140 

Thr Gly Gin Thr Leu Glu Gly Lys Gly Phe Val Glu Met Leu Cys Lys 
145 150 155 . 160 

Gly Leu Ser Gin Pro Thr Ser Asn Leu Val Ala Gly Cys Leu Gin Leu 
165 170 175 

Gly Pro Gin Ser Thr Leu Leu Glu Lys His Glu Glu Lys Ser Ser lie 
180 185 190 

Cys Asp Ser Thr lie Ser Val His Ser Phe Asn Tyr Gin Ser Pro Gly 
195 200 205 

Leu Pro Ser Pro Pro Tyr Gly His Met Glu Thr His Ser Leu His Leu 
210 215 220 

Lys Pro Gin Pro Phe Lys Ser Leu Gly Asp Ser Phe Gly Ser His Pro 
225 230 235 240 

Pro Asp Cys Ser Thr Pro Pro Tyr Glu Gly Pro Leu Thr Pro Pro Leu 
245 250 255 

Ser lie Ser Gly Asn Phe Ser Leu Lys Gin Asp Gly Ser Pro Asp Leu 
260 265 270 

Glu Lys Ser Tyr Asn Phe Met Pro His Tyr Thr Ser Ala Ser Leu Ser 
275 280 285 

Ser Gly His Val His Ser Thr Pro Phe Gin Thr Gly Thr Pro Arg Tyr 
290 295 300 

Asp Val Pro Val Asp Leu Ser Tyr Asp Ser Tyr Ser His His Ser lie 
305 310 315 320 

Gly Thr Gin Leu Asn Thr lie Phe Ser Asp 
325 330 



<210> 49 
<211>2264 
<212> DNA 
<213> Frog 

<400> 49 

gacagtcagt agatgctcct ttcccaaagt gtatcttgca gtagccttac actaatactg 60 
tjctgtcttgt agctggagag ctttgtateg atgctaagag cactgggtat ctacaaagag 120 
gatcactgca tatgaatgga ataaggagtg ctgctgctac ccaggctggg gtttgttccg 180 
agcccttcaa aaccttttgg ccatagaatc actgtgttga catgaagtca gattcaccag 240 
tgcatgggga gtcccatact gaatgccagt caccatgccc actaagttgc atgccagcca 300 
ggctggaagg ctctaccaag agacgtctgg ctgccaatgc cagggaaaga aggagaatgc 360 
aaggactgaa taccgccttc gatagtctga ggaaagttgt accgcaatgg ggtgaggaca 420 
aaaaactttc caagtatgag actctacaga tggcactgag ctacatcatg gcactaagca 480 
ggatcctcac ggaagcagaa agatacagca gaactgatcc aggggaatgg actaaaatgc 540 
adttgatca cattcaggaa gaacagtgcc tcagttatat gggagtgaga tgcccaagag 600 
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actgtgatcg ctacctgccc cagacttttt ctcactagga taggagatgt gagcaacagt 660 
cagcaggcaa ggtactatag acctgaagat agcagtgtat tcctacacac agcagccaat 720 
aatacaggga catttgcatc atgggttatt tgtcatgtca ttctgcccaa tgcactgctt 780 
atttcattaa gcacccaaag tcccaggact gggaatatat gtagggcacc ccacgtgatg 840 
cagcccaaag tatgtgctgc tgcaactgat agtgagctgt gggacactgg aaaagcaaag 900 
tgcgctggta ttttgtaaat gaaaatgtca ttatgggtgg catataataa ttacttacac 960 
acagcacagt tatataattt cattgctgtt agaaagcccc tttgtctctt accccccatc 1020 
ctgctttcca tgttactaat gcactggtat cagtgaacat gtagaaggta gtatttatac 1 080 
attagaaaaa cactgaatgt gcataaacat catttttctc aatatacatt tactatatag 1 140 
ttcagttggt aagtgctggt aaacagcagc tgcccacaag ttctgtgcac tgctcatagg 1200 
ggaaaggaaa catttttgcc acttgctaga gaatgctaat tgcattgtac tctgtaatat 1260 
Igtcttgtgt ccagggtaag aaaggtctaa attaaaatca ataggaggca taatttacac 1320 
atttccctta agctgtaatt ttctggcttt tgtctgcatt ttaatagtgg aaaattacaa 1380 
cgtgtataaa attcctactg gtctcattct acttgtttct gtaagacaaa ccctggtgtg 1440 
tttttaagga aaccttaagt aattcttttg aatagtatac atttcttgaa tgtttttttt 1500 
ttttttttca atgggcatga gttgtttatt tgtaaacaca gctccaaagt taacagattg 1560 
cttgtatctt tcaggtatgc aaatggtatt ttagtctatt ctgggacttt gaaatagaaa 1620 
gcaatatgca accctttaga attagaaaat gtagtacaac ggaacaaatg gtgaatggct 1680 
tggcagagta ctgtacatac tgacctatgg agcatagttg gctaaatcag tctgcagttc 1740 
acttttactg gtgtgttgca gttagaataa taaaatcgga tgttgctaag gttgttacag 1800 
tatttctttg ccccagttaa tgcatgacct cccatcttac agagtttcta acacgttgcc 1860 
aatgtattgt tatctttcga aaaatgtgga tggaacgtga tcaggtgcaa attcatgcga 1920 
tttacctgtg cataattcct ataaaacaga gacaatgtgt agttatagga ggattcacta 1980 
cacggatgaa agatcttatt tacaacaggt taggcacaaa aagtgcagga ccatgaaaaa 2040 
ttgtggccct ttagatttac taaattctgg cacccagcca ttgtgatgct gcacctgcac 2100 
tcagcaccat attataatcc agcacaaggc agtaagtata aggctcccat actcttctca 2160 
ctgataagct tgctagagat gttcccaagg aagcccatgt gctgcccatc ttctgccctt 2220 
cctctaggca gcactgcctc acatgcggaa tgaaggccca agcg 2264 



<210> 50 
<211>138 
<212> PRT 
<213> Frog 

<400>50 

Met Lys Ser Asp Ser Pro Val His Gly Glu Ser His Thr Glu Cys Gin 
1 5 10 15 

Ser Pro Cys Pro Leu Ser Cys Met Pro Ala Arg Leu Glu Gly Ser Thr 
20 25 30 

Lys Arg Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Gin Gly 
35 40 45 

Leu Asn Thr Ala Phe Asp Ser Leu Arg Lys Val Val Pro Gin Trp Gly 
50 55 60 

Glu Asp Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Leu Ser 
65 70 75 80 

Tyr lie Met Ala Leu Ser Arg lie Leu Thr Glu Ala Glu Arg Tyr Ser 
85 90 95 

Arg Thr Asp Pro Gly Glu Trp Thr Lys Met His Phe Asp His lie Gin 
100 105 110 

Glu Glu Gin Cys Leu Ser Tyr Met Gly Val Arg Cys Pro Arg Asp Cys 
115 120 125 
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Asp Arg Tyr Leu Pro Gin Thr Phe Ser His 
130 135 



<210> 51 
<211>2123 
<212> DNA 
<213> Frog 

<400> 51 

gacagtcagt agatgctcct tgcctgaagt gtatcttgca gtagccttac actgccttgt 60 
agctggagag ccttgtatca ctgctaacaa cactgggtat caacaaagaa gacaactgca 120 
taaaaatgaa ataaggagat actacccagg ctggggttta ttttgagtct ttcagaactt 180 
tctagggata gaatatctct gctgacatga agtcagattc accagtgcac agggagtccc 240 
atactggatg ccagtcacca tgcccactaa ggtgcttgcc agccaggctg gaaggctcta 300 
ccaagagacg tctggctgcc aatgccagag aaaggaggag aatgcaagga ctaaataccg 360 
ccttcgatag tctgaggaaa gttgtaccac aatggggtga agacaaacaa ctttccaaat 420 
atgagactct gcagatggcg ctgagctaca tcatggcact gagcaggatc ctctcggaag 480 
oagagaggta cagcaggact gatccagagg aatggactaa tattcaatat gatcacattg 540 
aggaggagca gtgcctcagt tatatggagg tgagatgccc aagagactgt gatcgttacc 600 
tgccccagac tttttctcac taggataaga gcaggcaagg tactactgac ctgaagacag 660 
cactgtttta atataatggg tcggttatac agcacccaat gatacaggga catttgcatc 720 
atgggctatt tgtcatgttg tttttcccaa tgcaatgctt atttccttaa gcaccctacg 780 
ttcaggactg ggtacatatg tagggaaccc caagtgatgc agcccagagg atgcggtgct 840 
gcaacggatg gcagttagtg agctgtggaa cactggaaaa gccaagtgca ctggtatttt 900 
gtgaaaggac atgcaagtta ttatgggtgg catataatat ttacctccat acagcacagt 960 
gatataactt cattgcctcc atcatacctg tgtgattata tataaaatgg tagttcctga 1020 
gtcaclactt tccatgttac ttatgcactg ttatcagata acatagagaa agtagtattt 1080 
atacattaga aaaagtacta tatgtgcata taaattggcg tttaaagcag tctggataaa 1 140 
tcattttctg tggactcttt tcttacccca aggagccatt tatagtttaa gtgctgctat 1200 
aggcactgct cataagggaa aggaaacatt tttgtcactt gctgataata caaattgcat 1260 
tclacaccag aattcttaat taactatact gtctagtgaa cagaaaggtc taaattaaaa 1320 
tcaacaagag gtataattta catattttac cattttctgg cttctgtctg ccttttgaga 1380 
gtggcaaatt acaacgtata taaaattcct acaggtctca atctacttgt tgctgtaaag 1440 
acaaaccttg gtgtattttt aaggaaacct taagtaattc ttttggttag tatacatttc 1500 
ttaaatattt ttatttgtaa tgggcgtcaa ttgtttattt gtaaacgcag ctccaaaaat 1560 
agcagatatg cttgtatttt tcaggtatgc aaatggtatt tcagcctata ttgggaaagc 1620 
agtaagcaaa tctttgtaat tgaaattgta gtaaaactga acatatggtg aattgcttgg 1680 
caatgtactg tatacatgct gacctataag ggctatgtag caaagttggg aaaaagtttt 1740 
ccctatcaaa atcactgcag gtttattggt ttgttgcaga tagaataata aaatctgaca 1800 
ttgctgatgt gggtaaggta ttttattccc catttaatat cttaatcttt cttcttacac 1 860 
tgaaaacatt actagcccct tgccaatgta ttcttatctg tcaaattcag gagttcatga 1920 
gacttacctg tacataatcc ctatgaaacc aagacaatgt gtggggaata agaggatttg 1980 
ctgcacaaat tagggcagag atacatggtc tgactgtgac aaatctcctc ttcttcggag 2040 
cgtttaatct ccccaaactg cctttccagg cactttgttc tccgaagtcg cccgaagttg 2100 
cctaacgagg caacttcagc acg 2123 



<210>52 
<211>138 
<212> PRT 
<213>Frog 

<400> 52 

Met Lys Ser Asp Ser Pro Val His Arg Glu Ser His Thr Gly Cys Gin 
1 5 10 15 

Ser Pro Cys Pro Leu Arg Cys Leu Pro Ala Arg Leu Glu Gly Ser Thr 
20 25 30 
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Lys Arg Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Gin Gly 
35 40 45 

Leu Asn Thr Ala Phe Asp Ser Leu Arg Lys Val Val Pro Gin Trp Gly 
50 55 60 

Glu Asp Lys Gin Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Leu Ser 
65 70 75 80 

Tyr lie Met Ala Leu Ser Arg lie Leu Ser Glu Ala Glu Arg Tyr Ser 
85 90 95 

Arg Thr Asp Pro Glu Glu Trp Thr Asn lie Gin Tyr Asp His He Glu 
100 105 110 

Glu Glu Gin Cys Leu Ser Tyr Met Glu Val Arg Cys Pro Arg Asp Cys 
115 120 125 

Asp Arg Tyr Leu Pro Gin Thr Phe Ser His 
130 135 



<210>53 
<211>136 
<212>DNA 
<213> Beetle 

<400> 53 

gcggcgaatg cgcgcgagcg gcggcggatg aacggcctga atgaagcttt cgatcggcta 60 
agacaagtta taccaagctt ggacgctgac cacaaattga gcaagtttga gactctgcag 120 
atggcccaga cctaca 136 



<210>54 
<211>45 
<212> PRT 
<213> Beetle 

<400> 54 

Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Asn Gly Leu Asn Glu Ala 
15 10 15 

Phe Aso Arg Leu Arg Gin Val lie Pro Ser Leu Asp Ala Asp His Lys 
20 25 30 

Leu Ser Lys Phe Glu Thr Leu Gin Met Ala Gin Thr Tyr 
35 40 45 



<210>55 
<211>137 
<212> DNA 
<213> Beetle 

<400> 55 

gcggcgaatg cgagggagag gcggaggatg aacagtttga atgacgcctt cgacaggctg 60 
cgggacgtgg tgccgtccct tgggaacgat cggaagctgt ccaagtttga gacacttcag 120 
atggcccaga cctacat 137 
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<210> 56 
<211>45 
<212> PRT 
<213> Beetle 

<400> 56 

Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Asn Ser Leu Asn Asp Ala 
1 5 10 15 

Phe Asp Arg Leu Arg Asp Val Val Pro Ser Leu Gly Asn Asp Arg Lys 
20 25 30 

Leu Ser Lys Phe Glu Thr Leu Gin Met Ala Gin Thr Tyr 
35 40 45 



<210> 57 
<211>1572 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> modified_base 
<222> (1497)..{1564) 
<223> N = any nucleotide 

<400> 57 

gtcctctgca cacaagaact tttctcgggg tgtaaaaact ctttgattgg ctgctcgcac 60 
gcgcctgccc gcgccctcx;a ttggctgaga agacacgcga ccggcgcgag gagggggttg 120 
ggagaggagc ggggggagac tgagtggcgc gtgccgcttt ttaaaggggc gcagcgcctt 180 
cagcaaccgg agaagcatag ttgcacgcga cctggtgtgt gatctccgag tgggtggggg 240 
agggtcgagg agggaaaaaa aaataagacg ttgcagaaga gacccggaaa gggccttttt 300 
*ttggttgag ctggtgtccc agtgctgcct ccgatcctga gcgtccgagc ctttgcagtg 360 
caatgtcccg cctgctgcat gcagaagagt gggctgaagt gaaggagttg ggagaccacc 420 
atcgccagcc ccagccgcat catctcccgc aaccgccgcc gccgccgcag ccacctgcaa 480 
ctttgcaggc gagagagcat cccgtctacc cgcctgagct gtccctcctg gacagcaccg 540 
acccacgcgc ctggctggct cccactttgc agggcatctg cacggcacgc gccgcccagt 600 
atttgctaca ttccccggag ctgggtgcct cagaggccgc tgcgccccgg gacgaggtgg 660 
acggccgggg ggagctggta aggaggagca gcggcggtgc cagcagcagc aagagccccg 720 
ggccggtgaa agtgcgggaa cagctgtgca agctgaaagg cggggtggtg gtagacgagc 780 
tgggctgcag ccgccaacgg gccccttcca gcaaacaggt gaatggggtg cagaagcaga 840 
gacggctagc agccaacgcc agggagcggc gcaggatgca tgggctgaac cacgccttcg 900 
accagctgcg caatgttatc ccgtcgttca acaacgacaa gaagctgtcc aaatatgaga 960 
ccctgcagat ggcccaaatc tacatcaacg ccttgtccga gctgctacaa acgcccagcg 1020 
gaggggaaca gccaccgccg cctccagcct cctgcaaaag cgaccaccac caccttcgca 1080 
ccgcggcctc ctatgaaggg ggcgcgggca acgcgaccgc agctggggct cagcaggctt 1 140 
ccggagggag ccagcggccg accccgcccg ggagttgccg gactcgcttc tcagccccag 1200 
cttctgcggg agggtactcg gtgcagctgg acgctctgca cttctcgact ttcgaggaca 1260 
gcgccctgac agcgatgatg gcgcaaaaga atttgtctcc ttctctcccc gggagcatct 1 320 
tgcagccagt gcaggaggaa aacagcaaaa cttcgcctcg gtcccacaga agcgacgggg 1380 
aattttcccc ccattcccat tacagtgact cggatgaggc aagttaggaa ggtgacagaa 1440 
gcctgaaaac tgagacagaa acaaaactgc cctttcccag tgcgcgggaa gccccgnggt 1 500 
taangatccc cgcacccttt aatttnggct ctgcgatggt cgttgtttag caacgacttg 1560 
gctncagatggt 1572 



<210> 58 

<211>354 

<212> PRT 

<213> Homo sapiens 
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<400>58 

Met Ser Arg Leu Leu His Ala Glu Glu Trp Ala Glu Val Lys Glu Leu 
15 10 15 

Gly Asp His His Arg Gin Pro Gin Pro His His Leu Pro Gin Pro Pro 

20 25 30 

Pro Pro Pro Gin Pro Pro Ala Thr Leu Gin Ala Arg Glu His Pro Val 
35 40 45 

Tyr Pro Pro Glu Leu Ser Leu Leu Asp Ser Thr Asp Pro Arg Ala Trp 
50 55 60 

Leu Ala Pro Thr Leu Gin Gly lie Cys Thr Ala Arg Ala Ala Gin Tyr 
65 70 75 80 

Leu Leu His Ser Pro Glu Leu Gly Ala Ser Glu Ala Ala Ala Pro Arg 
85 90 95 

Asp Glu Val Asp Gly Arg Gly Glu Leu Val Arg Arg Ser Ser Gly Gly 
100 105 110 

Ala Ser Ser Ser Lys Ser Pro Gly Pro Val Lys Val Arg Glu Gin Leu 
115 120 125 

Cys Lys Leu Lys Gly Gly Val Val Val Asp Glu Leu Gly Cys Ser Arg 
130 135 140 

Gin Arg Ala Pro Ser Ser Lys Gin Val Asn Gly Val Gin Lys Gin Arg 
145 150 155 160 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn 
165 170 175 

His Ala Phe Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp 
180 185 190 

Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin Met Ala Gin lie Tyr lie 
195 200 205 

Asn Ala Leu Ser Glu Leu Leu Gin Thr Pro Ser Gly Gly Glu Gin Pro 
210 215 220 

Pro Pro Pro Pro Ala Ser Cys Lys Ser Asp His His His Leu Arg Thr 
225 230 235 240 

Ala Ala Ser Tyr Glu Gly Gly Ala Gly Asn Ala Thr Ala Ala Gly Ala 
245 250 255 

Gin Gin Ala Ser Gly Gly Ser Gin Arg Pro Thr Pro Pro Gly Ser Cys 
260 265 270 

Arg Thr Arg Phe Ser Ala Pro Ala Ser Ala Gly Gly Tyr Ser Val Gin 
275 280 285 

Leu Asp Ala Leu His Phe Ser Thr Phe Glu Asp Ser Ala Leu Thr Ala 
290 295 300 

Met Met Ala Gin Lys Asn Leu Ser Pro Ser Leu Pro Gly Ser lie Leu 
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305 310 315 320 

Gin Pro Val Gin Glu Glu Asn Ser Lys Thr Ser Pro Arg Ser His Arg 
325 330 335 

Ser Asp Gly Glu Phe Ser Pro His Ser His Tyr Ser Asp Ser Asp Glu 
340 345 350 

Ala Ser 



<210> 59 
<211>485 
<212> DNA 
<213> chicken 

<220> 

<221> modified_base 
<222> (147) 

<223> N = any nucleotide 
<400> 59 

ccgctgctgg ggccggacgg ggcggctgcg gcttcgcccc cggctggctg ggcgtgtgct 60 
gcgccgcacg cgtgcccgcc gcgtcgccgc gctacctgct gcccgccgac gaggaggacg 120 
aggcggcccg tggcgggggg cgcggcncgc gttccggcgg gagcagcccc gggggagcgc 180 
ggggcggcgg cgggcgcgcg gggcggcggc ggcggggccg ggccgcgggc gcaggtgagc 240 
ggcgtgcaga agcagcggcg gctggcggcc aacgcgcggg agcggcggcg gatgcacggg 300 
ctgaaccacg ccttcgacca gctgcgtaat gtcatcccct ccttcaacaa cgacaagaag 360 
ctctccaagt acgagacgct gcagatggcg caaatctaca tcagcgccct cgccgagctg 420 
ctgcacgggc cgcccgcgcc ccccgagccg cccgccaagg ccgagctccg cggggccccc 480 
ttcga 485 



<210>60 . ... 

<211>161 
<212> PRT 
<213> chicken 

<400> 60 

Pro Leu Leu Gly Pro Asp Gly Ala Ala Ala Ala Ser Pro Pro Ala Gly 
15 10 15 

Trp Ala Cys Ala Ala Pro His Ala Cys Pro Pro Arg Arg Arg Ala Thr 
20 25 30 

Cys Cys Pro Pro Thr Arg Arg Thr Arg Arg Pro Val Ala Gly Gly Ala 
35 40 45 

Ala Arg Val Pro Ala Gly Ala Ala Pro Gly Glu Arg Gly Ala Ala Ala 
50 55 60 

Gly Ala Arg Gly Gly Gly Gly Gly Ala Gly Pro Arg Ala Gin Val Ser 
65 70 75 80 

Gly Val Gin Lys Gin Arg Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg 
85 90 95 

Arg Met His Gly Leu Asn His Ala Phe Asp Gin Leu Arg Asn Val lie 
100 105 110 
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Pro Ser Phe Asn Asn Asp Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin 
115 120 125 

Met Ala Gin lie Tyr lie Ser Ala Leu Ala Glu Leu Leu His Gly Pro 
130 135 140 

Pro Ala Pro Pro Glu Pro Pro Ala Lys Ala Glu Leu Arg Gly Ala Pro 
145 150 155 160 

Phe 



<21Q> 61 
<211>138 
<212> DNA 
<213> Puffer Fish 

<400> 61 

gcggcgaacg cgagggagag gaggagaatg cacggcctga ataaagcgtt tgacgaactg 60 
aggagcgtca ttccttccct ggaaaatgag agaaagctct ccaagtatga cactctccag 120 
atggcccaaa cctacatc 1 38 



<210> 62 
<211>46 
<212> PRT 
<213> Puffer Fish 

<400> 62 

Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn Lys Ala 
1 5 10 15 

Phe Asp Glu Leu Arg Ser Val lie Pro Ser Leu Glu Asn Glu Arg Lys 
20 25 30 

Leu Ser Lys Tyr Asp Thr Leu Gin Met Ala Gin Thr Tyr He 
35 40 45 



<210> 63 
<211>1477 
<212> DNA 
<213> Drosophila 

<400> 63 

atcatcttgt tagcggcttt agagccgaat cgttttctag cgccatttta agctcgcaac 60 
gaactgaggt ataaccgggc tctctgagac cgctgcaact caccaccaac tgccattggt 120 
cgtgccactc gggcggcacg tgctgccttc tgtggcaact cgtttacctg cccccctacx 180 
tgcctttcag gcccttctga ccgtcgtggt ggatttgtga gtataaatag ggccgaaagg 240 
acgagagacc agtcagaaac ccgccagcac tcgcagcgtt cgtatcgttt catccagcaa 300 
cataacacca ccatacagca gcagcaacat gtcgtccagt gagatctatc gctactacta 360 
uaagacctcc gaggacttgc agggcttcaa gacagccgcc gccgagccgt acttcaatcc 420 
catggcagcc tacaatcccg gcgtgacxxa ctaccagttc aatggcaaca ccctggccag 480 
cagcagcaac tacttgtcgg ccaatggctt catcagcttc gagcaggcca gttccgatgg 540 
ctggatctcc tcctcgccgg ctagccaccg atctgagagt cccgagtatg tggatctcaa 600 
taccatgtac aatggaggct gcaacaacat ggcccagaac caacaatacg gaatgattat 660 
ggagcagtct gttgtttcca cagcgcctgc aattccagtg gcctctcctc cggcagtgga 720 
ggtcatgggc tcctccaacg tgggcacttg caaaacgatt ccagcctcag cagctccgaa 780 
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accgaagcgt agctatacca agaagaacca gccaagcacc accgccacct ccacaccgac 840 
tgcagctgcg gagtcatctg cctcagtgaa tctctacacg gaggagttcc agaactttga 900 
ctttgacaac tccgccttgt tcgatgacag cgtcgaggat gacgaggacc tcatgctctt 960 
cagtggcggt gaggacttcg atggcaatga tggatccttt gacttggccg atggtgagaa 1020 
ccaagatgcc gctgccggag gctctggaaa gaagaggcgt ggcaagcaga tcacacccgt 1080 
cgtgaagagg aagcgtcgcc tggccgccaa tgcacgtgag cgtcgtcgga tgcagaacct 1140 
caaccaggcc ttcgatcgtc tccgccagta ccttccctgt ctgggaaacg atcgccagct 1200 
gtccaaacac gagaccctcc aaatggccca gacctacata tccgctctcg gggatctgct 1260 
gcgctgaatt cccggatccc gatcccagtc ccaagtacta ttctcagtta ttgttggagc 1320 
ttgccaaatg ttgtagctac tttgtatata ttgcctggag cccagtagtg aattaccgct 1380 
taagtattat gctgtttatt gtttagttaa ttagcctaaa tggaagacaa tgattaagac 1440 
taaggaagac aaaataaaag caccattaat aatttaa 1477 



<210>64 
<211>312 
<212> PRT 
<213> Drosophila 

<400> 64 

Met Ser Ser Ser Glu He Tyr Arg Tyr Tyr Tyr Lys Thr Ser Glu Asp 
1 5 10 15 

Leu Gin Gly Phe Lys Thr Ala Ala Ala Glu Pro Tyr Phe Asn Pro Met 
20 25 30 

Ala Ala Tyr Asn Pro Gly Val Thr His Tyr Gin Phe Asn Gly Asn Thr 
35 40 45 

Leu Ala Ser Ser Ser Asn Tyr Leu Ser Ala Asn Gly Phe lie Ser Phe 
50 55 60 

Glu Gin Ala Ser Ser Asp Gly Trp He Ser Ser Ser Pro Ala Ser His 
65 70 75 80 

Arg Ser Glu Ser Pro Glu Tyr Val Asp Leu Asn Thr Met Tyr Asn Gly 
85 90 95 

Gly Cys Asn Asn Met Ala Gin Asn Gin Gin Tyr Gly Met lie Met Glu 
100 105 110 

Gin Ser Val Val Ser Thr Ala Pro Ala lie Pro Val Ala Ser Pro Pro 
115 120 125 

Ala Val Glu Val Met Gly Ser Ser Asn Val Gly Thr Cys Lys Thr He 
130 135 140 

Pro Ala Ser Ala Ala Pro Lys Pro Lys Arg Ser Tyr Thr Lys Lys Asn 
145 150 155 160 

Gin Pro Ser Thr Thr Ala Thr Ser Thr Pro Thr Ala Ala Ala Glu Ser 
165 170 175 

Ser Ala Ser Val Asn Leu Tyr Thr Glu Glu Phe Gin Asn Phe Asp Phe 
180 185 190 

Asp Asn Ser Ala Leu Phe Asp Asp Ser Val Glu Asp Asp Glu Asp Leu 
195 200 205 

Met Leu Phe Ser Gly Gly Glu Asp Phe Asp Gly Asn Asp Gly Ser Phe 
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210 215 220 

Asp Leu Ala Asp Gly Glu Asn Gin Asp Ala Ala Ala Gly Gly Ser Gly 
225 230 235 240 

Lys Lys Arg Arg Gly Lys Gin lie Thr Pro Val Val Lys Arg Lys Arg 
245 250 255 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met Gin Asn Leu Asn 
260 265 270 

Gin Ala Phe Asp Arg Leu Arg Gin Tyr Leu Pro Cys Leu Gly Asn Asp 
275 280 285 

Arg Gin Leu Ser Lys His Glu Thr Leu Gin Met Ala Gin Thr Tyr He 
290 295 300 

Ser Ala Leu Gly Asp Leu Leu Arg 
305 310 



<210> 65 
<211>907 
<212> DNA 
<213> Frog 

<400> 65 

gccccggggc cactctgcgc acttgtcggg acttattcgc acttacctgt catggcccgt 60 
ctgctacacg gcgctgctac tgccgctgac tggtgcgagc tgaaggagct tccatccgag 1 20 
gccgggctct tggccagaga ttacctacta gacagcagcg acccccgcgc ctggctctcc 180 
gccacttccc tgcaaagtcg ccctgagtac gtgctgcacc ccccgggccg ggccgggcgc 240 
acaaggtgcg ggaactgtgc aaactgaagg ggctgcggga tgatgatgat gatgaggagg 300 
atgatgagga ggaggaagag agatccgagg ggctgtgcag acacaggggt ccccctggca 360 
agggccxjtgg tggggttcag aagcagagga gactggcagc caatgccagg gagaggagga 420 
ggatgcacgg gctcaatcat gccttcgatc agctccgtaa tgtcatccct tccttcaata 480 
acgacaagaa actctccaaa tacgagaccc tgcagatggc tcagatctac atcaacgccc 540 
tgtccgacct gctgcaggcg ccccccgact ccagagatcc cccctgcccg cccacctacc 600 
aactgcattc ggggccagag cccaggttag tccagtctgg cagcatgaga ttctcggaga 660 
cttccccccg acagtccccc ctcagccaat tccaggaggg agctgctccc agaagggaat 720 
aggatctggg cccatcttca tcttctcggg aagacatcgc ccatcttcat cttcggggag 780 
aagacagcaa gacatcgcaa gatctcatcg gagtgacggc gaattccggt ctccctatag 840 
tgagtcgtat taatttcgat aagccagctg cattaatgaa tcggccaaac gcgcggggag 900 
aggcggt 907 



<210> 66 
<211>259 
<212> PRT 
<213> Frog 

<400> 66 

Met Ala Arg Leu Leu His Gly Ala Ala Thr Ala Ala Asp Trp Cys Glu 
15 10 15 

Leu Lys Glu Leu Pro Ser Glu Ala Gly Leu Leu Ala Arg Asp Tyr Leu 
20 25 30 

Leu Asp Ser Ser Asp Pro Arg Ala Trp Leu Ser Ala Thr Ser Leu Girt 
35 40 45 
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Ser Arg Pro Glu Tyr Val Leu His Pro Pro Gly Arg Ala His Lys Val 
50 55 60 

Arg Glu Leu Cys Lys Leu Lys Gly Leu Arg Asp Asp Asp Asp Asp Glu 
65 70 75 80 

Glu Glu Asp Asp Glu Glu Glu Glu Glu Arg Ser Glu Gly Leu Cys Arg 
85 90 95 

His Arg Gly Pro Pro Gly Lys Gly Pro Gly Gly Val Gin Lys Gin Arg 
100 105 110 

Arg Leu Ala Ala Asn Ala Arg Glu Arg Arg Arg Met His Gly Leu Asn 
115 120 125 

His Ala Phe Asp Gin Leu Arg Asn Val lie Pro Ser Phe Asn Asn Asp 
130 135 140 

Lys Lys Leu Ser Lys Tyr Glu Thr Leu Gin IVIet Ala Gin lie Tyr He 
145 150 155 160 

Asn Ala Leu Ser Asp Leu Leu Gin Ala Pro Pro Asp Ser Arg Asp Pro 
165 170 175 

Pro Cys Pro Pro Thr Tyr Gin Leu His Ser Gly Pro Glu Pro Arg Leu 
180 185 190 

Val Gin Ser Gly Ser Cys Met Arg Phe Ser Gly Asp Phe Pro Gly Gin 
195 200 205 

Ser Pro Leu Ser Phe Gin Phe Gin Glu Gly Ala Ala Leu Ser Gly Lys 
210 215 220 

Gly lie Gly Ser Ala Pro Ser Ser Ser Ser Gly Glu Asp Ser Lys Thr 
225 230 235 240 

Ser Pro Arg Ser His Arg Ser Asp Gly Glu Phe Arg Ser Pro Tyr Ser 
245 250 255 

Glu Ser Tyr 



<210> 67 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
Construct 

<400> 67 

tgaagctttt ggctttgag 19 



<210>68 

<211>19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
Construct 

<400> 68 

ccgctgccaa attctttgg 19 



<210> 69 

<211>37 

<212> DNA 

<213> Homo sapiens 

<400> 69 

gggggcactg acagtaatgc atgccgtatt cgaagtt 37 
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